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ABSTRACT 

Future surveys of large-scale structure will be able to measure perturbations on the scale of the 
cosmological horizon, and so could potentially probe a number of novel relativistic effects that are 
negligibly small on sub-horizon scales. These effects leave distinctive signatures in the power spectra 
of clustering observables and, if measurable, would open a new window on relativistic cosmology. We 
quantify the size and detectability of the effects for the most relevant future large-scale structure 
experiments: spectroscopic and photometric galaxy redshift surveys, intensity mapping surveys of 
neutral hydrogen, and radio continuum surveys. Our forecasts show that next-generation experiments, 
reaching out to redshifts z ~ 4, will not be able to detect previously-undetected general-relativistic 
effects by using individual tracers of the density field, although the contribution of weak leasing 
magnification on large scales should be clearly detectable. We also perform a rigorous joint forecast 
for the detection of primordial non-Gaussianity through the excess power it produces in the clustering 
of biased tracers on large scales, finding that uncertainties of cr(/NL) "^1 — 2 should be achievable. 
We study the level of degeneracy of these large-scale effects with several tracer-dependent nuisance 
parameters, quantifying the minimal priors on the latter that are needed for an optimal measurement 
of the former. Finally, we discuss the systematic effects that must be mitigated to achieve this level 
of sensitivity, and some alternative approaches that should help to improve the constraints. The 
computational tools developed to carry out this study, which requires the full-sky computation of the 
theoretical angular power spectra for 0(100) redshift bins, as well as realistic models of the luminosity 
function, are publicly available at http://iiitensitymapping.physics.ox.ac.uk/codes.html. 


1. INTRODUGTION 

The current success of relativistic cosmology is primar¬ 
ily based on the use of observations of large-scale struc¬ 
ture to infer the properties of the Universe. The statis¬ 
tics of temperature and mass fluctuations, from maps 
of the cosmic microwave background (GMB) and galaxy 
surveys respectively, have been used to place remark¬ 
ably tight constraints on the key cosmological parame¬ 
ters (Planck Gollaboration 2015a), and we have now mea¬ 
sured the spectral index of primordial fluctuations, the 
Hubble constant, and the present-day densities of dark 
matter, baryons, and relativistic species to exquisite pre¬ 
cision. While many of these parameters can have similar 
effects on cosmological observations, by combining differ¬ 
ent observables at different times and length scales, we 
have been able to break the degeneracies between them. 

While a variety of upcoming surveys will certainly im¬ 


prove the existing cosmological constraints, they will also 
allow us to probe an altogether new regime of large- 
scale structure - perturbations that span the cosmologi¬ 
cal horizon. While such scales are routinely studied in the 
GMB, these measurements consist of a single two dimen¬ 
sional projection of the radiation density field and gravi¬ 
tational potentials at a fixed time. As such, the amount 
of information we can obtain from them is fundamentally 
limited by projection effects and cosmic variance. With 
three-dimensional maps of the matter density held, on 
the other hand, it should be possible to greatly rehne 
our measurements of horizon-scale perturbations and, in 
doing so, explore a variety of new relativistic effects in 
cosmology. 

As pointed out in Bonvin et al. (2006); Yoo et al. 
(2009); Yoo (2010); Bonvin & Durrer (2011); Ghallinor & 
Lewis (2011), relativistic effects come into play through 
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apparent distortions of the projected survey volume by 
leasing, the propagation of light through inhomogeneous 
potentials, and the large-scale effect of peculiar veloci¬ 
ties at the source. While they are strongly sub-dominant 
on scales of order 100/i“^Mpc, where current galaxy 
surveys are focused, they can substantially modify the 
power spectrum of (e.g.) the number density of galaxies 
from the usual Newtonian predictions on extremely large 
scales. 

The familiar Kaiser redshift-space distortion is a sub¬ 
horizon approximation to a general relativistic redshift- 
space distortion, with the post-Kaiser terms becoming 
non-negligible only around the horizon scale. Terms ac¬ 
counting for lensing, time delays, and the Sachs-Wolfe 
and integrated Sachs-Wolfe effects are also present. (For 
analysis of each of these effects in large-scale structure 
observables, see Bruni et al. 2012; Jeong et al. 2012; Yoo 
et al. 2012; Bertacca et al. 2012; Hall et al. 2013; Lom- 
briser et al. 2013; Yoo & Desjacques 2013; Raccanelli 
et al. 2014; Yoo & Seljak 2015). 

Most of these additional effects can safely be ignored 
in standard analyses of galaxy clustering, simply because 
they are negligible for current surveys of limited volume. 
A partial exception is the lensing term, which contributes 
to the observed overdensity (modulated by the magnifi¬ 
cation bias), and which has occasionally been incorpo¬ 
rated into clustering analysis - it can become significant 
on sub-horizon scales, but only at high redshift. The re¬ 
maining terms^ will be referred to as the “GR terms” for 
the purposes of this paper, which focuses on very large 
scale effects. We will define the GR terms more precisely 
in due course. The hope, then, is that the GR effects can 
be teased out of cosmological data on ultra-large scales, 
and used to test the standard relativistic model of cos¬ 
mology. 

Another effect that can come into play on large 
scales is a scale-dependent bias due to primordial non- 
Gaussianity (Dalai et al. 2008; Matarrese & Verde 2008). 
If the primordial fluctuations are non-Gaussian, there 
will be a coupling between short and long wavelength 
modes such that the clustering of galaxies with respect 
to the underlying density field is enhanced on large scales. 
Specifically, the galaxy bias gains a scale-dependence 
proportional to ^ where k is the wavenumber of 

the mode being observed. The scale on which this effect 
comes into play is again of the order the cosmological 
horizon if the parameter quantifying the non-Gaussianity 
is /nl 1- Hence, in addition to detecting relativistic 
effects in structure formation, measurements of horizon- 
scale modes can also be used to constrain the statistics 
of primordial fluctuations, and thus the mechanism that 
seeded structure in the early Universe. 

As we can see, the scientific returns from measuring 
horizon-size fluctuations are legion. In this paper, we 
take a comprehensive view of future observations, and 
attempt to quantify how well both relativistic effects and 
primordial non-Gaussianity can be constrained with up¬ 
coming surveys. To do so, we examine four different 
types of survey: spectroscopic and photometric galaxy 
redshift surveys, continuum surveys of radio galaxies, 

^ While excluded from our list of “GR terms”, we will also pay 
some attention to the lensing term, as it is a ‘hybrid’ term that is 
only observable via clustering in very high volume surveys. 


and intensity mapping surveys of neutral hydrogen (HI). 
Each of these techniques will probe different redshifts 
with different sensitivities, but all will (in principle) be 
able to access horizon scale modes in the next decade. 
The surveys will also measure different combinations of 
the relativistic corrections, and will be sensitive to differ¬ 
ent systematic effects. We will therefore pay close atten¬ 
tion to identifying the different obstacles for detecting 
large scale modes, and discussing possible methods for 
mitigating their effects. 

The paper is structured as follows. In Section 2 we 
present the various large-scale effects that we are tar¬ 
geting, focusing on the relativistic effects and primordial 
non-Gaussianity, and discuss how they might show up in 
galaxy surveys and intensity mapping experiments. We 
also discuss the nuisance parameters that determine the 
amplitudes of these effects, and their dependence on as- 
trophysical uncertainties. In Section 3, we briefly present 
the Fisher forecasting formalism we will be working with. 
Then, in Section 4, we systematically present the fore¬ 
casted uncertainties in these observables for the differ¬ 
ent types of survey, discussing the specific experiments 
that we consider, their sensitivity to ultra large-scale ob¬ 
servables, possible degeneracies with nuisance parame¬ 
ters and their main sources of sysematic uncertainties. 
The models used to describe the signal and noise for each 
experiment are summarized in Appendix B. Finally, in 
Section 5 we discuss our findings and draw conclusions 
for the prospects for detecting the ultra large-scale ef¬ 
fects. 

2. ULTRA LARGE-SGALE COSMOLOGIGAL 
OBSERVABLES 

2.1. Relativistic effects in large-scale structure 
observables 

The aim of this section is to compute all the terms that 
give rise to fluctuations in the number of light-emitting 
sources measured by an observer in a given redshift in¬ 
terval dz and observed solid angle dflo- The main re¬ 
sult from this calculation is the following: neglecting all 
perturbations in the trajectory of the photons emitted 
by these sources, the observed perturbation in the num¬ 
ber counts is simply given by the perturbation in the 
comoving number density of sources. The comoving 4- 
volume that we ascribe to the patch defined by dz and 
dHo depends on the direction and redshift of the photons 
received from these sources, however, and therefore any 
perturbation in their trajectory will induce additional 
contributions to the total fluctuation of source number 
counts. 

Redshift space distortions (RSDs) are a perfect exam¬ 
ple of one of these contributions, where the observed red- 
shifts are perturbed by the peculiar velocity of the source. 
While RSDs have been well understood for decades, sev¬ 
eral other terms have only recently been rigorously quan¬ 
tified (Ghallinor & Lewis 2011; Bonvin & Durrer 2011). 
These terms are relativistic in nature, and affect the clus¬ 
tering spectrum only on extremely large scales, approach¬ 
ing the horizon size. Since these relativistic effects have 
been thoroughly discussed in the literature, the aim of 
this section is not to provide a derivation from first prin¬ 
ciples, but rather to give some physical intuition for the 
interpretation of each of these terms, as well as to estab- 
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lish the notation that will be used in what follows. 


2.1.1. Relativistic lightcone effects: number counts 

Consider a set of sources with comoving number den¬ 
sity ns( 77 ,x) (as measured in their own rest frame), and 
4-velocity These sources emit photons with a wave 
vector = dx^/dX (A is an affine parameter of the pho¬ 
ton geodesic), and rest-frame energy k^u^. During an 
interval dX of the affine parameter, the photons cover a 
volume dAf. {k^ u^)dX, where dA^ is the invariant area of 
the wavefront corresponding to the observed solid angle 
dflo- Throughout, we have labelled quantities measured 
in the emitter’s and observer’s frames with subscripts e 
and o respectively. 

The total number count in a redshift interval dz cor¬ 
responding to dX is therefore 


dN dAf. 


( 1 ) 


Each of these terms is straightforward to compute in the 
absence of perturbations: 


= risiviz)), 


{kn O 


dz 


a-iviz)) 

H{r]{z)y 


Here r]{z) is the background conformal time at redshift 
z, r(z) is the background comoving angular diameter 
distance, a{ri{z)) = 1/(1 + z) is the scale factor, and 
H = d/a is the expansion rate. For the rest of this work 
we will assume a flat background cosmology, so that ra¬ 
dial (x) and angular distances are the same. 

In the presence of inhomogeneities, all of these quanti¬ 
ties are perturbed with respect to their background val¬ 
ues at redshift z, and in general we can write: 


77(z,h) = 7?(z) -f dry, 
ns{z,n) = ns{r]{z)) 1-f d„-f 
rl A 

^^a2(ry(z))x2(z) [l + 2dj, 
“ id(ry(z))^ + 11^’ 


9 Ini 


dfj 


-6r] 


( 2 ) 

( 3 ) 

( 4 ) 

( 5 ) 


where Sn is the perturbation to ris, and we have defined 
the perturbations to the conformal time, Srj, transverse 
distance, dj_, and radial distance, dy. 

One extra detail must be taken into account: not all 
sources are equally bright, and will generally be dis¬ 
tributed according to a particular luminosity function, 
71 ^( 77 , In L, x), which we define as the density of sources 
in a logarithmic interval of luminosity: 

d(# sources) 

"■ = dVdluL ^ 

Only sources with a flux (observed power per unit de¬ 
tector area) above a given detection cut, Fcut, will be 
detected. Flux and luminosity are related by an inverse- 
square law in angular distance, so perturbations to the 
angular diameter distance will affect the measured flux. 


Linearizing with respect to these perturbations gives 


F(z,h) = 


47 r(l -b z)4a2(ry(z))x2(^) 


[l-2dj. (7) 


At a given redshift and flux cut, we will only observe 
sources with luminosities above a threshold Lcut, related 
to Fcut by the previous equation. In order to take this 
into account, we must replace ris by the cumulative lu¬ 
minosity function, 

nOO 

Af \nL) = / (ilnF'ns(?7,x,lnL'), (8) 

JlnL 


so that Eq. (3) becomes 


Af (z, n, Fcut) 


AT 


1 + 5j\f + 





We have shortened our notation such that 


A/' = A/'(r 7 (z),> In Fcut(^, Fcut)) (9) 

(and likewise for h^), and have overlined (~) all quantities 
evaluated in the background. 

The full linear expression for the source number counts 
can finally be written as 

+ ( 10 ) 


where the perturbation is given by 


AAr(z,n) = 5 m + 


91nA7 

dr] 


Sr] + (5|| + 2i5_l 


1 


rw 


( 11 ) 


In order to simplify the notation, from now on we will re¬ 
fer to the observed background number of sources found 
per unit redshift and solid angle simply as N{z), i.e. 


iV(z)^A7^X^(z). (12) 

The terms dr], Jy, and <5j_ can be related to the met¬ 
ric, density, and velocity perturbations by solving the 
geodesic equation for photons in any gauge. In the con¬ 
formal Newtonian gauge, defined by the line element 


ds^ = —a^{r]) [(1 + 2ip) drf — (1 — 2(/)) JydxMxl] , (13) 
these perturbations read 


HSr] = — ijj + 
Sii = 


j (</' + y') dr] - 


1-^ 

772 


HSr] + ijj + Vr 


1 

H 


dib dvr 

dr] dr] 


=H5r] - 

X 


Srj -\- J {(p + tp) dr] 


1 


2 Jo XX 




(14) 

(15) 

(16) 
re (17) 

(18) 


where = n • is the radial peculiar velocity of the 
sources, Vq is the Laplacian on the unit sphere, and k 
is the lensing convergence. Note that we have denoted 
all partial derivatives with respect to conformal time as 
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drjb = b' (and H = a’/a), and that the operator d/dr] de¬ 
notes a total lightcone derivative along the unperturbed 
photon trajectory, 

I (19, 

where 770 is the age of the Universe. Likewise, all integrals 
shown in the equations above must be understood as 
lightcone integrals along the same trajectory. 


2.1.2. Relativistic lightcone effects: intensity mapping 

Besides source number counts, another promising ob¬ 
servational tool for studying large-scale structure is a 
technique known as intensity mapping. The technical 
details of this method are discussed in Section 4.1, but 
we will describe the relevant relativistic effects here (see 
also Hall et al. 2013). 

In intensity mapping, the observable used to trace 
the matter density is the intensity received from a line- 
emitting medium integrated over a patch of the sky (i.e. 
the total power measured in a frequency interval per unit 
detector area and observed solid angle). We assume that 
this line emission is caused by some well-defined transi¬ 
tion line, and can therefore be used to recover the redshift 
of the source by comparing the observed frequency with 
the known rest frame one. In the rest frame of a set of 
line-emitting sources, the emissivity is 


dEe 

dtedVed^tedVe 


ft . , ..X2 Pa 

-A2lUeg}{Ve) - 

2 


( 20 ) 


where A 21 is the Einstein coefficient for the transition, 
Pa is the comoving density of the emitting gas, rua is its 
atomic mass, X 2 is the (number) fraction of the gas in 
the excited state, and g^iy) is the line profile (normalized 
to unity when integrated over all frequencies). 

As shown in Section 2.1.1, the volume covered by the 
emitted photons in an affine parameter interval dX is 


dVe = dAf. (fc^ u'^)dX. (21) 

Assuming that no absorption or scattering of the emit¬ 
ted photons occurs, the emissivity can be related to the 
measured intensity by 


/(zz„,h) = 


dE„ 


dtodAodVlodv, 
hVoA2lX2P: 


2ma 


(p{ve)ikf,u'^)dX 


dAf.dXla dVf.dte 

dAnXln dVndtn 


The frequencies and time intervals in both frames are 
directly related through the redshift z, as are angles and 
invariant areas (from Etherington’s reciprocity relation), 


dtadvg ^ dAgd^le 1 

dtodvo ’ dAodXlo {1 + zY 


( 22 ) 


Using these relations, and assuming that observations 
will take place on frequency intervals IXvo much larger 
than the line width, we finally obtain the relation 


/(j7o,n) 


hA2lV2lX2 
2TOa(l -I- zY 


Pa {kf, UY) 


dX 
dz ’ 


(23) 


except for the factor of the angular diameter distance, 
dAa/dflo. This is because the observable in intensity 
mapping is not the total number of objects in a given 
patch of sky, but the combined emitted light from the 
same patch. Since luminosities and angular distances 
are affected in the same way by lightcone effects, these 
cancel exactly for intensity mapping. 

Expanding both pa and (fc^ u^) ^ to linear order, we 
can therefore compute all of the linear perturbations to 
the observed intensity mapping signal, 


/(i/,n) = /(i/) [1 -f A/] 


= 


1 + + 


9 In Pa 
dr} 


(5r7-b(5|| , 


(24) 


where 5a is the intrinsic perturbation of the emitting gas 
density. By comparing this with Eq. (11), we can see that 
the linear perturbation for intensity mapping is equiva¬ 
lent to the perturbation of the number counts for a popu¬ 
lation of sources with a particular form of the luminosity 
function, such that the number of sources observed above 
a given flux is proportional to the luminosity associated 
with that flux, 

Mm(> L) cx L. (25) 


The somewhat unfortunate consequence of this result is 
that there are no linear perturbations to angular dis¬ 
tances for intensity mapping,^ which could potentially 
reduce the amount of cosmological information that can 
be extracted from this probe. 

The background term in Eq. (24) is commonly ex¬ 
pressed in terms of antenna temperature, defined through 
the Rayleigh-Jeans relation for a black-body emitter 
T{v) = I{v) Y/{2kBv'^), where fcs is the Boltzmann con¬ 
stant. In terms of background quantities, the homoge¬ 
neous intensity mapping signal is 

- _ ZhA2lX2Y Hlnb^oXa{z){l+zY 

?i2'KGkBmai'2i dl{z) ’ 


where Xa(z) is the fraction of baryons made up by the 
line-emitting species under study. 


2.1.3. Clustering bias 

Until now, we have not related the intrinsic perturba¬ 
tion in the number density of sources to the perturbations 
of the energy-momentum tensor. Assuming that galaxies 
form in dark matter haloes, which themselves form pref¬ 
erentially in high density regions, one would expect the 
halo (or galaxy) number overdensity to trace the fluc¬ 
tuations in the overall matter density on large (linear) 
scales with a simple linear bias factor, Jhaio — &haio<^M- 
This bias is a central piece of the halo model of structure 
formation (Mo & White 1996; Peacock & Smith 2000), 
the validity of which has been extensively tested against 
numerical simulations (Cole et al. 2008). Although the 
linear bias is expected to be scale-dependent on non¬ 
linear and mildly non-linear scales, where non-linear and 
stochastic bias terms could also be important, we are 
mainly interested in large-scale observables in this paper, 
where the approximation of a linear, scale-independent 
bias should be valid (although see Section 2.2). This 


where 1^21 is the rest-frame line frequency. We can see ^ Note that this is equivalent to the result that leasing of the 

that this is equivalent to Eq. (1) for number counts, CMB is a second-order effect. 
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bias will depend on redshift and luminosity, however (e.g. 
more luminous, and therefore rarer, objects are expected 
to be more highly biased). 

Since it is not possible to unambiguously define the 
matter overdensity Sm in a gauge-invariant way in a 
general-relativistic context, a subtle point is the choice 
of overdensity field on which the bias relation is applied. 
In this work we take the point of view of Challinor & 
Lewis (2011); Baldauf et al. (2011); Jeong et al. (2012); 
Bruni et al. (2012), and argue that, since the process of 
galaxy formation is due to local physics, and since we ex¬ 
pect our sources to follow the same velocity field as the 
dark matter, the bias relation should be applied in the 
synchronous comoving gauge. Note that it is also the co¬ 
moving gauge perturbation that appears in the Poisson 
equation. A more complete discussion of this argument, 
which can also be extended to the case of primordial 
non-Gaussianity, can be found in Baldauf et al. (2011). 

It follows that the intrinsic perturbation to the number 
density of sources, 6n, in the Newtonian gauge - our 
choice for this work - is related to the matter overdensity 
in the synchronous comoving gauge, Sm, syn, through 


(In — Zy k) Sm, syn 


d\nAf V 
dr] k ’ 


(27) 


where v is the peculiar velocity in Newtonian gauge, and 
we have allowed the bias, b, to be scale dependent, in 
anticipation of the discussion in Section 2.2. 


2.1.4. Magnification and evolution bias 

The amplitudes of the perturbations to the conformal 
time and transverse distances depend explicitly on the 
derivatives of the luminosity function of the source pop¬ 
ulation with respect to luminosity and time (see Eq. 11). 
It has become common to express these derivatives in 
terms of the so-called magnification bias, s{r]), and evo¬ 
lution bias, /evo(?7)) defined as 


s{v) = 


fevoiv) — 


5 ns(7?, InZeut) 
2M(r],> InLcut)’ 
dln[a^ff{r], > InZ 

cut)] 

9 In a 


(28) 

(29) 


As with the clustering bias, the values of s and /evo de¬ 
pend on the source population under study, so must be 
modelled correctly in order to maximise the information 
that can be extracted from any clustering analysis. While 
b{z) must be determined directly from clustering statis¬ 
tics, it is possible to estimate s and /evo directly from the 
overall number counts of sources as a function of redshift 
and magnitude. Let N{z, < m*) be the cumulative num¬ 
ber of sources with magnitude m brighter than m*, per 
unit solid angle and redshift interval. N is related to the 
luminosity function, fis, through 


CY^(z) 

N{z,< m^) = , ns{r],\nL)dlnL, 

{i + zyH{z) 

(30) 

where the threshold luminosity L* is L* = 47r(l -|- 
z)‘^X^{^)P*t and fluxes and magnitudes are related 
through 



F 


(31) 



Fig. 1.— Amplitude of the different terms listed in Eqs. (39-44) 
relative to the amplitude of in the power spectrum. The C^s 
were calculated for a redshift bin at 2 = 2 with width Az = 0.12 
and for constant bias functions ( 6 ( 2 ) = 1.5, 5 ( 2 ) = 1, /evo(-z) = !)• 
We have also included in green the contribution of primordial non- 
Gaussianity for /nl = 1- 


Note that we have neglected evolution and ^-corrections. 

Using the definitions of /evo and s in Eqs. (28) and 
(29), these quantities can be related to the derivatives of 
N with respect to z and m* by 


d^ogipN ^ ^ 

9m* 

d\og,oN (2-5s) H' 

'91ogio(l + ^) XaH ® aH"^ 


(32) 

(33) 


Note that, in order to use these relations to estimate s 
and /evo, it is necessary to have full redshift information 
about the source distribution. While this is available by 
default for spectroscopic surveys, determining the red¬ 
shift distribution becomes more involved for photomet¬ 
ric and radio continuum surveys. This is nevertheless a 
necessary task if these probes are to be usable for cosmo¬ 
logical studies, where (e.g.) the redshift and photometric 
redshift distributions must be correctly modelled. In any 
case, the uncertainties on s and /evo will tend to grow to¬ 
wards large z, and must therefore be taken into account 
in any cosmological analysis. 

As we described in Section 2.1.2, the case of intensity 
mapping is slightly different. In this case, perturbations 
to the angular distance cancel, which is equivalent to 
setting the magnification bias to the critical value sim = 
2/5. /evo can be determined directly from the redshift 
dependence of the background brightness temperature, 
T{z) (Eq. 26). 


2.1.5. Power spectra 

The most informative observable regarding the cluster¬ 
ing of astrophysical sources is their two-point correlation, 
(Av(^i, ni)Av(^ 2 , n 2 )). The perturbation Av can be 
expressed in terms of spherical harmonic coefficients, 

aim{z) = 


/ 


dh A_^f{z,h)Ye 

m (ll); 


(34) 
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where Yimin) are the spherical harmonics. The cluster¬ 
ing of number counts can then be studied through the 
angular power spectrum, defined by the correlation 

{aem{zi)4,^,{z2)) = 5u ' ^mm' Q(zi,Z2), (35) 

where angle brackets denote an ensemble average. 

In practice, A(z, h) is not measured in infinitesimal 
intervals of z, but by averaging over a set of finite radial 
bins, which we will label here by a Latin index, i. The 
observed anisotropy in bin i is 

(36) 

where the window function Wi is normalized to 1 when 
integrated over redshift. The shape of Wi is determined 
both by the background redshift distribution of observed 
sources, fV(z), and the probability that a source at red¬ 


shift z will be included in the i-th bin Pi{z), so that 

Wi{z) (X N{z)pi{z). (37) 

Using this, one can show that the cross-spectrum be¬ 
tween two bins can be written as (Di Dio et al. 2013) 

r°° rlk 

Cy=47r ^Vik)Alik)A^,{k), (38) 
Jo k 

where V{k) is the dimensionless primordial power spec¬ 
trum, which is assumed to take the form V{k) = 
As (fc/fco)"””^, and A\{k) contains the transfer functions 
of the terms that contribute to the anisotropy in bin i, in 
Fourier space and projected on the sky. Expanding the 
various contributions to Eq. (11), the functions A\{k) 
can be written as a sum of 10 terms corresponding to 
different physical effects (Di Dio et al. 2013); 


A°’*(/c) = / dpbWi SM,syn{k, rj) ji{kx{v)), 


A 


RSD,i 


(fc) = dp{aH) ^Wi{r])9{k,r])f^{kx{v)), 


(39) 


A^'\k) = i{e + 1) I dr]Wl^{r]){(j) + 'ijj)(k,r))ji{kx{v)), A/"’*(fc) = / dp - J>) aHW,{p) 


VI,i/ 


e{k,p) . 




je{kxiv)), 


dp[l + 


A^^’\k) = dp [2 + 


H' 


2-5s 


aH'^ X ciH 


+ 5s - /e- 


Wi{p) feikxiv)), 


H' 


+ 


2-5s 


aH'^ X clH 


+ 5s-/evo w^{p) 'ipik,p) ji{kx{p)), 


A^^’\k) = J dp{5s-2)W,{p)(t){k,p)ji{kx{p)), A^^’\k) = j dp{aH) '^W,{p) (j)'{k,p) ji{kx{p)), 


A^^’\k)= I dpW[‘^{p){(j) + ^){k,p)je{kx{p)), Af'^’*(fc) = I dpWP'^{p){(l) + 'iljy{k,p)je{kx{p)), 


lSW,i/ 


where we have defined the window functions 

-1 


lU.(r 7 (z)) = lU.(z) ( g 


Wtip)= / dp'Wiip' 


/3 2 - 5s{p') xiv) - xiv') 
2 x{v)x{r]') 


W^ip) = / dp'Wyp') 


, 2-5s 


X 


lUfW(^)= / dp'Wyp')(l+^ + 


H' 


2-5s 


aH"^ X aH 


+ 5s - fe¬ 


rn 

(41) 

(42) 

(43) 

(44) 

(45) 

(46) 


In these equations, the quantities 5M,syn{k,p), 9{k,p), 
ip{k,p), and (j){k,p) are the transfer functions for the 
synchronous comoving gauge matter density perturba¬ 
tion, the divergence of the peculiar velocity, and the two 
metric potentials respectively. 

Each term is sourced by a different physical effect. A^ 
corresponds to the intrinsic perturbation in the comov¬ 
ing number density of sources, which is the dominant 
contribution in most cases, and is the only term that has 
traditionally been taken into account when RSDs and 
lensing can be neglected. A|^®° is the usual RSD term 
corresponding to the Kaiser effect, due to the deforma¬ 
tion of the Lagrangian volume in redshift space. A^ is 
the lensing convergence term, caused by the deformation 
of the Lagrangian volume in the transverse directions due 
to weak lensing. The terms A^^ and A^^ are extra RSD 
contributions that come from evaluating the background 


terms at a redshift perturbed by the Doppler effect. The 
remaining terms correspond to to the same effect, but for 
redshift perturbations caused by gravitational redshift- 
ing instead of peculiar velocities. In particular, A®^ is 
the analogue of the integrated Sachs-Wolfe (ISW) effect 
(Sachs & Wolfe 1967) for number counts. 

Of these terms, the first three give the largest contribu¬ 
tion to the total clustering anisotropy, so only these have 
traditionally been included in clustering analyses. The 
remaining terms are mainly relevant on super-horizon 
scales at the position of the sources, and even on those 
scales their amplitude is significantly smaller than the 
first three (see Fig. 1). Nevertheless, these terms con¬ 
tain useful information that could potentially be used, for 
example, to constrain different theories of gravity (Lom- 
briser et al. 2013; Baker & Bull 2015). One of the aims of 
this paper is to forecast the detectability of these terms 
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by future experiments. In order to do so, we have de¬ 
fined an effective parameter, ecR, which multiplies the 
terms A^^^^andA^^^ and has a fiducial value 

of 1. ecR therefore parametrizes the amplitude of the 
relativistic corrections to the clustering of sources. 

Even though the origin of the lensing convergence term 
(A^) is clearly General-Relativistic, we have not included 
it under the umbrella of cgr for two main reasons: first, 
we would like to focus on yet-undetected effects, and 
lensing magnification has already been detected by cross- 
correlating pairs of distant tracers (Scranton et al. 2005; 
Hildebrandt et al. 2009). Secondly, in this work we aim 
to identify possibly-detectable observables on ultra-large 
scales, but the lensing term has a non-negligible effect 
on small angular scales. Nevertheless, for completeness 
we have also forecasted for the detectability of lensing 
magnification by defining an effective amplitude, ewL, 
multiplying A^. In keeping with the main aim of this 
paper, we will only produce forecasts for this parameter 
based on its effects on the largest angular scales (lowest 
multipoles), however. 

Thus, to clarify the terminology used here, we will refer 
to the terms and as “GR effects” or 

“GR terms” and to A'^ as the “lensing term”, even if the 
nature of the latter is clearly relativistic. 

2.2. Primordial non-Gaussianity 

A fundamental assumption of current theories of large- 
scale structure is that the primordial seed fluctuations 
can be described as a multivariate Gaussian random 
process, uniquely characterised by the primordial power 
spectrum (Baumann 2009). For many years, the Gaus- 
sianity of primordial fluctuations was one of the main 
predictions of inflationary theories of the early Universe. 
More recently, the possibility of non-Gaussian primor¬ 
dial fluctuations has been revisited for two main rea¬ 
sons. On the one hand, a battery of statistical tech¬ 
niques have been developed to quantify primordial non- 
Gaussianity, primarily from the CMB, but also adapted 
to large-scale structure data (Yadav & Wandelt 2008; 
Fergusson & Shellard 2009; Planck Collaboration 2014c). 
There is a hope that these techniques will uncover some¬ 
thing that will enrich our understanding of the early Uni¬ 
verse, above simple one- and two-point statistics. On 
the other hand, the Effective Field Theory approach to 
inflation (Cheung et al. 2008; Weinberg 2008) can be 
used to systematically quantify all possible deviations 
from the quadratic action of linear perturbation theory 
around quasi-de Sitter space. These deviations, in the 
form of higher-order terms, will lead to non-trivial Gaus¬ 
sian signatures that are directly related to the fundamen¬ 
tal parameters (and more importantly, the fundamental 
structure) of the theory of the early Universe. 

A useful (though not universal) way of describing devi¬ 
ations from primordial Gaussianity (Komatsu & Spergel 
2001) is to assume that the gravitational potential $ can 
be decomposed into a quadratic polynomial in a Gaus¬ 
sian random field (f>, of the form 

<i> = </' + /NL(</>" -(</>")), 


where /nl is, in the simplest scenarios, a constant, /nl 
has been calculated for a family of inflationary models. 
In the local (‘squeezed’) configuration, it is expected to 
be of order \ns — 1| ^ 0(10“^), where Us is the spectral 
scalar index of primordial fluctuations (although some 
non-canonical models can lead to /nl ~ 1; Verde et al. 
2000; Liguori et al. 2006; Smith & Zaldarriaga 2011). 
Current constraints from higher order statistics of CMB 
maps place the constraint at |/nl| ^ 7 (Planck Collabo¬ 
ration 2015b). 

A novel proposal is to look for the effects of non- 
Gaussian initial fluctuations at the level of the power 
spectrum. It has been shown that primordial non- 
Gaussianity induces a scale- and redshift-dependence for 
any biased tracer, X, of the overall density field (Matar- 
rese & Verde 2008; Dalai et al. 2008). If the density 
contrast of X has a linear, Gaussian bias, 6^, then the 
non-Gaussianity of the distribution will induce a correc¬ 
tion of the form 


Abx{z,k) = 3/nl 


[6g(z) - i]nMH^s 
{T{k)D{z)k^) 


C 


(47) 


Here, VLm = Uf, -|- Uq is the fractional energy density in 
matter (i.e. baryons plus dark matter), Tfg is the Hubble 
constant, 5c — 1.686 is the critical density contrast of 
matter from the spherical collapse model, T{k) is the 
matter transfer function, and D(z) is the linear growth 
function for density perturbations. 

As can be seen from Eq. (47), there will be a substan¬ 
tial enhancement of the tracer power spectrum on large 
scales, with a ~ 1/fc^ scale dependence (since T ^ 1 on 
large scales). A rough estimate is that the transition 
scale is of order k^a /nl-Hq) i-e. we expect the scale- 
dependent signature to kick in close to the horizon scale 
for /nl 1- There have already been some attempts 
to constrain /nl from the scale-dependent galaxy bias, 
although they have been severely hampered by system¬ 
atic effects on extremely large scales (Giannantonio et al. 
2014). In parallel, there have also been attempts to fore¬ 
cast the possibility of measuring /nl ~ 1 with future 
surveys (e.g. Namikawa et al. 2011; Giannantonio et al. 
2012; Camera et al. 2013; Ferramacho et al. 2014; Cam¬ 
era et al. 2015b; Raccanelli et al. 2015). 

Given the nature of this signature - the fact that it 
arises on large scales and has a 1/A:^ scale dependence 
- it has been argued that non-Gaussianity may be de¬ 
generate with the relativistic effects we are studying in 
this paper (Bruni et al. 2012; Jeong et al. 2012; Bertacca 
et al. 2012). We will therefore include the effect of /nl 
throughout our analysis and, in the process, present the 
most up-to-date and conservative forecasts for its de¬ 
tectability with future surveys. As illustrated in Fig. 
1, the extra power induced by a value of /nl 1 on 
large scales is typically similar to the amplitude of the 
relativistic corrections presented in the previous section. 

Including our three main observables (/nlj ^gR) and 
cwl) the total perturbation to the number counts is: 



(48) 


+ ewL + ecR 


RSD,i 


+ A 


where there is an implied scale- and time-dependence in 
all of these terms. 


3. FORECASTING FORMALISM 

The spherical harmonic coefficients of the fluctuation 
in the observed number counts in the i-th redshift bin, 
contain most of the information about the clustering 
of sources. Assuming that they are Gaussian-distributed 
with a variance given by the cross-power spectra 
(Eq. 38), it is straightforward to show that the log- 
likelihood for a given realization of the harmonic coef¬ 
ficients is 


ln/: = -- 


'^aim - In (det[Q]) 


■ const., 


(49) 


where we have written for each £ and to as a vector 


= Ofm’ ^ad the set of cross-spectra as a matrix 

[Cf]ij = Cy. Our aim here is to forecast the precision 
with which different experiments will be able to measure 
a certain set of parameters, {9a}, which boils down to 
predicting the parameter covariance matrix. An efficient 
way of doing this is to use the Fisher matrix formalism, 
wherein the likelihood is approximated by a Gaussian 
expansion of Eq. (49) around a hducial set of parameters. 


ln£ = Y,{0a - ea)Fap{ep - 9^) + 0{9^), (50) 


where we have defined the Fisher matrix Fap = 
{d'^\nC/d9ad9p). The covariance matrix Cap = {{9a — 
9a){9p — 9p)) can then be approximated as the inverse of 
Fap- For our likelihood, one can show that 

FaP = /sky Tr [{daU) {dpCi) C,"!] , 

(51) 

where da = d/d9a- 

We model the observable aim as the sum of two contri¬ 
butions: aim = afm + Corresponding to signal (i.e. 
cosmological anisotropies) and noise (non-cosmological 
fluctuations due to instrumental or shot noise). We will 
also assume that both contributions are uncorrelated, so 
that Cf = Cf -I- N^. Here, Cf is given by Eq. (38), and 
is the noise power spectrum, the exact form of which 
will depend on the particular type of experiment. 

The theoretical power spectra Cf were computed us¬ 
ing a modihed version of the public CLASS code (Les- 
gourgues 2011; Di Dio et al. 2013). Our modifica¬ 
tions are documented in detail in Appendix A. For our 
fiducial cosmology, we adopted a model consistent with 
the best-fit flat AGDM parameters from Planck (Planck 
Gollaboration 2014b), given by {VLm, fb,h,w,As,ns) = 
(0.315,0.156,0.67,-1,2.46 x 10"®, 0.96), where fb = 
ftb/^M is the baryon fraction. We further set the fiducial 


Y2,i 


+ A 


pi.j 


. P2,i 


. P3,2 


■A 


P4,i 


■A 


ISW,1 


value of /nl to 0, the value for Gaussian initial condi¬ 
tions, and cgr = cwl = 1- 

While we are primarily interested in forecasting for /nl 
and cgRj we must also marginalize over other parameters 
that could be correlated with them, which includes the 
six other cosmological parameters listed above, as well 
as the bias nuisance parameters described below. When 
forecasting for the uncertainty on /nl, we do not consider 
CGR as an extra free parameter, and fix it to its fiducial 
value of 1. Gonversely, for the cgr forecasts we fix all but 
/nl and cgr, assuming that only a possible degeneracy 
with /nl could hamper a detection of the GR effects (all 
other parameters would simply change the shape of the 
GR correction ‘template’). 

The derivatives required by Eq. (51) were computed 
using central finite differences, 


q n_fi^a+S9a) f{9a S9a) (n(S:f)3\ 

where we chose intervals 69a such that the estimated 
derivatives converged to the required numerical accuracy. 

We also need to impose priors on certain parameters 
(e.g. the bias functions) to mitigate degeneracies. These 
are straightforward to incorporate into the Fisher matrix 
formalism: a Gaussian prior on {9a} with a covariance 
Clp can be added directly to the Fisher matrix as 


F, 


0.(3 


Fo.p + {C^)ll. 


(52) 


For the cosmological parameters in particular, it is use¬ 
ful to add a Planck GMB prior, which we construct by 
estimating their covariance matrix directly from the cor¬ 
responding Planck MGMG chains (Planck Gollaboration 
2014a). 

Finally, the constraints that any experiment will be 
able to yield will depend crucially upon the smallest and 
largest scales that can be used. For angular scales, this is 
explicitly taken into account in Eq. (51) as the maximum 
multipole, ^max, that we sum up to. This cutoff is deter¬ 
mined by either the angular resolution of the experiment 
(e.g. the beam size for intensity mapping), or by the non¬ 
linear scale, beyond which the theoretical predictions be¬ 
come unreliable and the modes must be discarded from 
the analysis. The smallest radial scale corresponds to the 
comoving width of the redshift bins used, and is also de¬ 
termined by either instrumental effects (e.g. the redshift 
resolution in continuum and photometric surveys) or the 
non-linear scale. In any case, since the effects we aim to 
study in this work are dominant on large scales, most of 
the information about them is concentrated on scales well 
inside the linear regime, and our final results are fairly 
insensitive to the choice of a minimum scale. By default 
we assume fmax = 500 for all of the probes considered 
here, and address the redshift binning for each case indi¬ 
vidually. In terms of the maximum angular scales, this 
is set by the sky area surveyed by each telescope. In our 
analysis, all surveys can probe down to £ = 2. In later 
sections we analyse the effect of changing this minimum 
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Fig. 2.— Linear clustering bias (top panel), evolution bias (mid¬ 
dle panel) and magnification bias (bottom panel) for the different 
experiments considered here. 

1. The largest radial scale is set by the available redshift 
range, or the maximum scale that is free from foreground 
contamination in the case of intensity mapping. 

As described in Section 2.1, the amplitude of the 
number count spectra also depends on three redshift- 
dependent parameters: the clustering bias, b{z), the 
magnification bias, s{z), and the evolution bias, fevo{z)- 
We will refer to these three parameters collectively as 
the “bias functions” in what follows. The value of b{z) 
can only be determined from the clustering statistics of 
the sample under study, and must therefore be marginal¬ 
ized over in the cosmological analysis. As discussed 
previously, 5 ( 2 :) and fevo{z) can be estimated from the 
redshift-magnitude distribution of the sources, although 
these estimates will inevitably be uncertain and, to some 
extent, model-dependent. Properly accounting for this 
uncertainty is vital, as the behaviour of the bias func¬ 
tions can strongly affect the detectability of the signal. 
In the absence of strong prior measurements from obser¬ 
vations or simulations, these parameters must therefore 
also be marginalized over. 

In order to do this, we defined a small number of red- 
shift bins that sample the bias functions well enough for 
each survey. The mean values of the bias functions were 
computed in each bin, and a linear interpolation between 
those values was used to dehne each fiducial bias func¬ 
tion. We then marginalized over the functions by treat¬ 



Fig. 3.— Illustration of the procedure used in this work to 
marginalize over the bias functions in the case of the HI cluster¬ 
ing bias: first the theoretical function is calculated (solid red line). 
Then, the mean values of the function in the redshift bins listed 
in Table 1 are computed (black circles), and the linear interpola¬ 
tion between these values (black dashed line) is used as the fiducial 
function in the computation of the power spectra. Finally, in order 
to marginalize with respect to this bias function, the mean values 
mentioned above are used as free parameters, and varied to com¬ 
pute the numerical derivatives in Eq. (51) (blue circles and dotted 
blue lines). 

ing each of the mean values as an additional free param¬ 
eter that was included in the computation of the Fisher 
matrix (see Figure 3). We also explored other strategies, 
such as higher order interpolation and local modifications 
to the fiducial functions in each bin, but the method 
described above was found to be the best compromise 
between simplicity and stability to variations in each pa¬ 
rameter. We also confirmed that the final results do not 
change significantly for the different methods. The red¬ 
shift bins used for the bias functions for each of the four 
survey types are given in Table 1, and the input bias 
functions for the surveys^ are shown in Figure 2. 

4. FORECASTS 

The aim of this section is to present the forecasted 
uncertainties on our main observables (/nLj cwl and 
egr) for the main four types of cosmological surveys 
that will be used to measure them in the future: in¬ 
tensity mapping, radio continuum surveys, spectroscopic 
redshift surveys and photometric redshift surveys. For 


Survey 

bins 

Bin edges 

Int. map. (SKAl-MID) 

5 

0.0, 

0.7, 1.3, 

2.0, 3.0, 4.0 

Cont. survey (SKA) 

5 

0.0, 

0.5, 1.0, 

2.0, 3.0, 5.0 

Spectre. (Hck survey) 

4 

0.5, 

0.75, 1.0, 1.3, 2.1 

Photo. (LSST), red 

4 

0.0, 

0.35, O." 

’, 1.05, 1.4 

Photo. (LSST), all 

5 

0.0, 

0.5, 1.0, 

1.5, 2.2, 3.0 


TABLE 1 

Redshift bins used for the bias function nuisance 

PARAMETERS, FOR EACH SURVEY. 


^ The codes used to estimate the bias functions for the models de¬ 
tailed in Sects. 4.1—4.4 can be found at http: //intensitymapping, 
physics.ox.ac.uk/codes.html. 
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Experiment type 

Experiment 

[zo,z/] 

{S/N) 

{b) 

(^> 

(/evo) 

Intensity mapping 

SKAl-MID 

[0.1,3.5] 

6.7 

1.2 

0.4 

-0.4 

Continuum survey 

Scut — 10/iJy 

[0,3] 

8.3 

1.9 

0.4 

-0.6 


‘S'cut — 5/j.Jy 

[0,4] 

13 

2.5 

0.3 

-0.3 


'S'cut — 1/^Jy 

[0,5] 

32 

2.9 

0.3 

-0.2 

Spectroscopic survey 

Hck survey 

[0.65,2.05] 

3.6 

1.5 

0.55 

-2.0 

Photometric survey 

LSST-red 

[0,1.4] 

25 

1.75 

0.04 

3.5 


LSST-full 

[0,2.5] 

210 

2.1 

0.35 

-0.7 


TABLE 2 

Properties of the experiments under consideration. The quantities shown in columns 2-6 are, in order: the approximate 

REDSHIET RANGE PROBED BY EACH EXPERIMENT, THE AVERAGE SIGNAL-TO-NOISE RATIO (COMPUTED BY AVERAGING THE RATIO OE THE 
SIGNAL AND NOISE POWER SPECTRA OVER REDSHIET AND ANGULAR SCALE), AND THE CLUSTERING, MAGNIFICATION AND EVOLUTION BIASES 

AVERAGED OVER REDSHIET. 


each of them we will also discuss the main sources of sys¬ 
tematic uncertainties that could prevent these measure¬ 
ments. The signal and noise models assumed for each of 
these experiments are presented in detail in Appendix B. 

4.1. HI intensity mapping 

Intensity mapping (IM) is a relatively new technique, 
but has a number of potential advantages for the study 
of ultra-large scales (Battye et al. 2004; Wyithe & Loeb 
2008; Chang et al. 2008). The idea is to observe the 
unresolved emission integrated over many line-emitting 
sources that are assumed to trace the large-scale mat¬ 
ter distribution, sacrificing angular resolution for survey 
speed. For source populations with sufficiently narrow, 
isolated emission lines, high redshift resolution can never¬ 
theless be obtained with a suitable spectroscopic instru¬ 
ment, allowing the 3D redshift-space matter distribution 
to be reconstructed. Thus, for a given pointing on the 
sky, one measures the combined emission from all the 
sources in it essentially as a continuous field, much like 
the CMB, rather than as a set of separately-identifiable 
objects. There is therefore no need to set a flux thresh¬ 
old that rejects most of the peaks in the signal for being 
insufficiently far above the noise level; the whole of the 
signal can be used, but different modes will be recovered 
with more or less noise. This leads to significant gains 
in survey speed, making it possible to cover extremely 
large volumes - and thus constrain ultra-large scales - in 
a relatively short time. 

The most developed technique to date is HI intensity 
mapping, which uses the redshifted 21cm radio emission 
line of neutral hydrogen (HI) as its tracer. HI is ubiq¬ 
uitous even in the post-reionisation Universe, where it is 
mostly confined to dense, self-shielded Damped Lyman- 
a systems. Forthcoming large, high-sensitivity, wide- 
bandwidth radio arrays such as the Square Kilometre 
Array (SKA), and purpose-built IM experiments like 
CHIME, are expected to be able to detect fluctuations 
in the cosmological HI signal over ^ 75% of the sky, for 
a wide range of redshifts (Bull et al. 2015). We have fo¬ 
cused our analysis on Phase I of the SKA, which has the 
large survey area and extremely wide frequency/redshift 
coverage needed to access ultra-large scales (although we 
have also computed the constraints for a cosmic variance- 
limited experiment). 

The SKA is a proposed and partially-funded multi¬ 
science radio facility that will be able to survey a large 


fraction of the sky in the frequency range from ^ 50 MHz 
to ^ 20 GHz. It will comprise two different instruments, 
built at separate sites in the South African Karoo region 
and Western Australia’s Murchinson region: 

• SKAl-MID: an array of ~ 200 single-pixel, 15m 
dishes to be installed in South Africa. It will cover 
the frequency range 350-1760 MHz (z < 3) in two 
separate bands. 

• SKAl-LOW: a set of about 455 aperture array sta¬ 
tions each with 35m diameter. It will cover the 
frequency range 50-350 MHz (3 < z < 20). 

Further information regarding the baseline design for the 
SKA can be found in Dewdney et al. (2009) and Braun 
et al. (2015). 

As discussed in Bull et al. (2015) and Santos et al. 
(2015), the most efficient use of SKAl for intensity map¬ 
ping at late times (z < 3) would be to use SKAl-MID 
as an auto-correlation experiment. If used in interfero¬ 
metric mode, the number of short baselines - needed to 
resolve the BAO scale and larger - is not large enough. 
This fact is all the more important for ultra-large scales, 
and so we have focused on the SKAl-MID, single-dish 
case. 

4.1.1. Constraints on relativistic effects 

As we have argued above, the two main sources of 
extra power on ultra-large scales are primordial non- 
Gaussianity and relativistic corrections. In order to 
study the detectability of the latter, we can therefore 
treat only /nl and cgr as free parameters, and fix the 
rest to their fiducial values. While this procedure would 
clearly yield an optimistic prediction of the actual con¬ 
straint on CGRj it mimics what a survey attempting a 
first detection of any new effect would do: fix all non¬ 
degenerate parameters to their best-fit values, and fit for 
the amplitude of the terms related to the new effect. If, 
in doing this, the SNR on the amplitude of the effect is 
smaller than unity, then there is no point in even consid¬ 
ering the covariance with other parameters. 

We applied this procedure for both SKAl-MID and 
a cosmic variance-limited survey (/sky = 1, = 0), 

obtaining the following result: 

SKAl-MID —^ cr(eGR) = 2.75, (53) 

GV-limited —> cr(eGR) = 1-97. (54) 
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Fig. 4.— Dimensionless angular power spectrum for a HI in¬ 
tensity map at redshift 2 : ~ 3 with width A 2 ~ 0.5 (orange) and 
dimensionless TT angular power spectrum of the CMB (blue). The 
much larger amplitude of perturbations at lower redshifts for in¬ 
tensity mapping can explain the difficulty in detecting the effect of 
relativistic corrections in LSS. 

Thus, even in the best-case scenario, it is not possible 
to detect the effect of the relativistic corrections on the 
brightness temperature fluctuations. 

This may seem like a striking result, as one of the rel¬ 
ativistic effects is the equivalent of the CMB ISW ef¬ 
fect for large-scale structure in Eq. 44), and the 

CMB ISW has already been detected above 4cr by cross- 
correlating CMB maps with different LSS datasets (Gi- 
annantonio et al. 2008). In simplistic terms, intensity 
mapping surveys can be thought of as a set of uncorre¬ 
lated “CMB” maps at different redshifts, so it is legiti¬ 
mate to ask why a similarly significant measurement is 
not possible in this case. 

This can be explained in terms of the clustering vari¬ 
ance of both datasets. Consider an attempt to measure 
the ISW effect by cross-correlating two datasets, one at 
high redshift (e.g. the CMB or a high-z HI intensity 
map), which we label here by a superscript h, and an¬ 
other at low redshift (e.g. a galaxy survey or low-z in¬ 
tensity map), which we label by g. Assuming that the 
ISW is the only term that could give rise to a significant 
cross-correlation between both datasets, the signal would 
be given by the cross-power spectrum, 

^isw = (55) 

Neglecting any instrumental or shot noise, and assum¬ 
ing full-sky coverage and Gaussian statistics, the noise is 
purely given by the sample variance, 

^isw (56) 

where we have assumed that <C Except 

for factors of order unity, the amplitude of the signal 
depends only on the value of (j)' + 'll)' at the redshift of 
g, and not on the nature of the high-redshift sample, so 
it will be roughly the same for both a high -2 intensity 
mapping bin and the CMB. The difference in SNR be¬ 
tween the two cases must therefore depend primarily on 


the amplitude of the noise, which differs by the ratio of 
Cg’' for the two cases. Since perturbations have grown 
significantly since zcmb 1100 , we can expect the in¬ 
tensity mapping power spectrum to have a much larger 
amplitude, (jhh,CMB^ which would explain the 

difficulty of achieving a good SNR in this case. This is 
explicitly shown in Figure 4: even at the highest redshift 
we considered, the intensity mapping power spectrum is 
4 orders of magnitude larger than that of the CMB. 

Another effect conspiring against the detection of the 
relativistic terms in an intensity mapping survey is the 
fact that the perturbations on transverse scales cancel 
exactly (i.e. s{z) = 2/5), as previously discussed. This 
further suppresses the overall amplitude of the relativis¬ 
tic effects, and is the reason why we do not present fore¬ 
casts for ewL in this case. 

4.1.2. Co'nstramts on pri'mordial non-Gaussianity 

The uncertainty on the level of primordial non- 
Gaussianity measured by an intensity mapping exper¬ 
iment will depend on our prior knowledge of the free 
parameters of the model. Here we have imposed a (non¬ 
diagonal) Gaussian prior on the cosmological parameters 
{Hjv/, /h, h, wq, ris^As} using a prior covariance matrix es¬ 
timated from the appropriate Planck 2015 MGMG chains 
for our set of parameters (Planck Gollaboration 2015a). 
As we have argued, it is reasonable to assume that, by 
the time the SKA attempts to measure /nl 7 prior infor¬ 
mation will be available regarding the HI clustering bias 
(e.g. from experiments on smaller scales) and evolution 
bias (e.g. from external measurements of Thi( 2 )). 

Before we assume any specific priors for these parame¬ 
ters, it is worth studying their effect on ^(/nl) in order to 
quantify how good prior measurements will need to be in 
order to optimize the constraints on /nl- We first stud¬ 
ied the level of degeneracy between /nl and b{z) by esti¬ 
mating the value of (t(/nl) assuming a relative Gaussian 
prior on 6 ( 2 ), constant across the whole redshift range. 
The result is shown in the left panel of Fig. 5. We ob¬ 
serve that a mild improvement on (t(/nl) (~ 10 % for 
SKAl-MID and ~ 20% for a CV-limited survey) can be 
achieved only for extremely accurate prior measurements 
of the clustering bias {Ab/b < 10“3). Since it would not 
be realistic to expect such a tremendous accuracy, we 
adopted a fiducial prior on b of 10 %, more compatible 
with current measurements of the bias of neutral hydro¬ 
gen (Masui et al. 2013). 

Using this fiducial prior, we then explored the degen¬ 
eracy between /nl and /evo by studying the dependence 
of o'(/nl) on the prior uncertainty, A/evo- The result 
is shown in the right panel of Fig. 5. While cr(/NL) in¬ 
creases only slightly when factoring in the uncertainties 
on the clustering bias, we observe a much larger increase 
(e.g. by a factor 2.4 for SKAl-MID) when we assume 
no knowledge about the evolution bias of the sample at 
all. This suggests a much stronger degeneracy between 
/nl and /evo, which could affect any attempt at measur¬ 
ing /nl with LSS probes (note that a similar issue was 
reported by Gamera et al. 2015b). 

The source of the degeneracy can be understood 
by comparing the scale dependence of the terms in 
Eqs. (39-44) that are proportional to /evo and /nl 
respectively. Primordial non-Gaussianity introduces a 
term, included in A/', with a fc-dependence of the form 
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Fig. 5.— Left panel: uncertainty in /nl a function of the prior on b(z). A constant Gaussian relative prior was imposed over the 
whole redshift range. Right panel: assuming a prior of 10% on b(z) (e.g., 10% uncertainty). This plot shows the dependence of on 

the prior for fevo- 


oc S(k)j{(kx)/{k^ T{k)) ~ S(k)j{(kx)/k^, where the sec¬ 
ond equality holds on ultra-large scales. The evolution 
bias, on the other hand, multiplies four different terms: 

1 . proportional to 'ip(k)j((kx) oc S(k)ji(kx)/k^. 

2. which is proportional to 0(k)ji(kx)/k'^ oc 

S(k)ji(kx)/k^. 

3. A)^^, proportional to 0(k)j^(fcx)//c. For sufficiently 
large £ (> 5), this term is also proportional to 
S(k)ji(kx)/k^. 

4. the ISW term, which involves a much wider 
window function covering the full photon path from 
the source. 

Thus, on large scales, three out of the four terms involv¬ 
ing /evo have the same scale dependence as the /nl term, 
which explains the result found above. Fortunately, as 
can be seen in Fig. 5, the effect of this degeneracy disap¬ 
pears if we can assume a relatively loose prior on /evo of 
A/evo ^ 1- I-O. if we parametrize the evolution of back¬ 
ground density of HI in the Universe as phi (a) cx o“, the 
slope a should then be measured with an error Aa < 1. 
In what follows we assume that such an accuracy will 
be available from external measurements of Thi(z), and 
impose a Gaussian prior of A/gvo = 1- 
For our fiducial set of priors (Planck CMB priors for 
the cosmological parameters, Ab/b — 0.1, and A/gvo = 
1), the final constraints on /nl for SKAl-MID and for 
a cosmic variance-limited survey are given in Table 3. 
We also include results for an IM survey with a higher 
fiducial bias, which we discuss in Section 5. 

4.1.3. Systematic uncertainties 

One of the most important observational challenges 
for intensity mapping is the presence of galactic and ex- 
tragalactic radio foregrounds (e.g. galactic synchrotron 
emission and extragalactic continuum radio sources) with 
amplitudes several orders of magnitude larger than the 



Fig. 6. — Constraint on /nl ^ ^ function of the maximum co¬ 
moving separation between pairs of redshift bins included in the 
analysis (at z ~ 1.7, 1 Gpc corresponds to ~ 100 MHz). 

cosmological HI signal. The potential bias and extra vari¬ 
ance induced in the measured signal by the process of 
foreground removal must be correctly taken into account 
in any analysis. 

The strategy underlying most foreground cleaning 
methods is to exploit the very different frequency struc¬ 
tures of signal and foregrounds. Most foreground sig¬ 
nals have a very smooth frequency dependence, while 
the cosmological signal traces the stochastic fluctuations 
in the matter density, and is therefore much “noisier” 
in the radial (frequency) direction. Broadly speaking, 
most cleaning methods try to remove the foregrounds by 
fitting and subtracting a set of smooth functions of the 
frequency from the combined foreground -I- cosmological 
signal. See Alonso et al. (2015) for a description and 
comparison of different methods. 

Since foregrounds are smooth in frequency, and fre- 
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Fig. 7. — Uncertainty on /nl as a function of the largest angular 
scale (minimum i) included in the Fisher matrix calculation. Sys¬ 
tematic effects and incomplete sky coverage can prevent recovery 
of the lowest multipoles (in this plot, sky coverage, /skyi is fixed 
to the survey specifications). 

quency is a proxy for radial distances for an IM experi¬ 
ment, we can expect the foreground-cleaned maps to be 
dominated by systematics on large radial scales. These 
scales must then be omitted from the analysis, which re¬ 
duces the sensitivity of an experiment to /nl- In order to 
understand the importance of this effect, we have studied 
the dependence of cr(/NL) on the maximum radial sepa¬ 
ration between redshift bins included in the computation 
of the Fisher matrix, i.e. we set to zero the off-diagonal 
elements of corresponding to pairs of bins separated 
by more than some radial separation Ay. 

The results are shown in Figure 6 for SKAl-MID and 
for a cosmic variance-limited survey. Reducing the range 
of the cross-correlations included in the analysis can de¬ 
grade the sensitivity to /nl significantly, enlarging the 
errors by up to a factor ~ 3.3 in the case of SKAl-MID. 

Given that measurements of ultra-large scale modes 
will typically need to be done in single dish (autocorrela¬ 
tion) mode, one also needs to worry about the stability of 
the instrument and the observational strategy. Much as 
in CMB experiments, long term noise drifts (the “1//” 
noise) will lead to striping in the maps, i.e. a coherent 
set of large angle features that have been artificially pro¬ 
jected on the sky. A sensible choice of scan strategy that 
leads to appreciable cross-linking between the scans can 
mitigate the effect, but there will always be a residual 
large-angle contaminant. Again, we can model this ef¬ 
fect by not including the very large angular modes in the 
analysis (tantamount to assuming that they are filtered 
out by the destriping process). Fig. 7 gives some idea of 
the impact of this effect on the constraints: a severe cut 
in the large angle data significantly degrades any attempt 
to detect large-scale features. 

4.2. Radio continuum surveys 

Radio continuum surveys measure the integrated emis¬ 
sion of radio sources in one broad frequency band (or a 
small number of them). At radio wavelengths, the spec¬ 
tral energy distributions (SEDs) of most sources are gen¬ 


erally smooth and featureless, except for a few radio lines 
such as the aforementioned 21cm signal (which can only 
be resolved for individual sources after a long integration 
time). For this reason, there is little distance information 
to be gained by integrating the flux density over more 
than a single, wide band. In turn, using a wide band 
significantly increases sensitivity, allowing much fainter 
sources to be observed than it would be possible to de¬ 
tect otherwise. Continuum surveys can therefore cover 
extremely large volumes, with the caveat that essentially 
all information on radial scales (even the ultra-large ones) 
is completely inaccessible. 

That continuum surveys have the potential to con¬ 
strain the level of primordial non-Gaussianity has already 
been shown in the literature (Ferramacho et al. 2014; 
Raccanelli et al. 2015), especially if the survey can be 
split into several different populations so that the multi¬ 
tracer technique (Seljak 2009) can be used. We attempt 
to reproduce this result here, as well as exploring the 
possible degeneracies of /nl with the relativistic correc¬ 
tions, and the possibility of detecting the latter. As with 
the other probes, we treat the continuum survey as be¬ 
ing single-tracer only, and will not study the potential of 
the multi-tracer technique here. (This is left for future 
work, in which all the possible cross-correlations will be 
considered systematically.) 

Our forecasts focus on a continuum survey with Phase 
1 of the SKA, since this would correspond to the widest 
proposed survey area to date in the ^ /rJy regime. SKAl 
should be able to detect radio sources out to z ~ 5 over 
an area of about 37r steradians. This survey would be 
carried out using the SKAl-MID facility, integrating the 
source flux in the band 350-1050 MHz with an rms noise 
of S'rms — l^Jy (see Jarvis et al. (2015) for details). 

It is common to define the source detection limit 5'cut 
to be several times higher than the noise level of the 
experiment (usually by a factor of 5 or 10, depending 
on the intended use of the sample). Assuming an rms 
instrumental noise for SKAl-MID of S'rms = ImJYj we 
have defined a fiducial 5 (t detection limit (i.e. S'cut = 
5/rJy). In order to explore the dependence on the survey 
depth, we have also produced forecasts for S'cut = lO^Jy 
and Scut = lliJy- 

4.2.1. Constraints on relativistic effects 

Relativistic effects in the continuum angular power 
spectrum were considered in Maartens et al. (2013) and 
Chen & Schwarz (2015). As in the previous section, 
we first present the most optimistic forecasts for cgr by 
marginalizing only over /nl- If the uncertainty on cgr 
is larger than unity (i.e. no detection of GR effects) in 
this optimistic case, there is no point in exploring more 
realistic scenarios. 

There are two main differences between intensity map¬ 
ping and continuum surveys in terms of the quantities 
that can affect these forecasts. First of all, we can ex¬ 
pect the lack of radial information in continuum surveys 
to considerably degrade the constraints on most param¬ 
eters. Perturbations to transverse scales will affect the 
observed clustering of radio sources in this case, however 
(i.e. s{z) 7 ^ 2/5, unlike for intensity mapping), which 
will enhance the amplitude of the relativistic terms. 

The constraints on cgr found for the three flux limits 
that we considered are shown in Table 3. Due to the 
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and the magnification and evolution biases (bottom left and right panels, shown in terms of an absolute prior). 


lack of radial information, there is no hope of detecting 
GR effects using only a single-tracer analysis, in spite of 
the enormous volume probed; cr(eGR) = 17.1 even for the 
deepest (1/rJy) survey. 

We have also produced forecasts for the detectability of 
the weak lensing term on large scales for continuum sur¬ 
veys, following the same logic used in the case of cgr (be. 
we keep all other parameters fixed, except for /nl)- In 
order to pick up only the large-scale lensing contribution 
we also used a more stringent value of £max = 100. The 
results are summarized in Table 3: a continuum survey 
with a flux limit of l^Jy would be able to clearly detect 
the large-scale lensing effect above ~ 4a, although the 
level of this detection would be below 2cr for S'cut = 5/rJy. 
No detection would be possible for a flux cut of 10/rJy. 

4.2.2. Constraints on primordial non-Gaussianity 

In a continuum survey, the available information is 
compressed into only a small amount of data - the an¬ 
gular clustering statistics of radio sources - due to the 
lack of any sensitivity to radial modes. We can therefore 
expect an even larger degree of degeneracy between the 
various cosmological and nuisance parameters than for 


intensity mapping. As before, we assume Planck CMB 
priors on all cosmological parameters (except /nl), and 
start our discussion of the primordial non-Gaussianity 
forecasts by exploring the effect of prior information 
about the bias functions on the /nl constraints. The 
results of this analysis are shown in Figure 8. 

The top left panel of Fig. 8 shows the forecast uncer¬ 
tainty on /nl as a function of a constant relative Gaus¬ 
sian prior for b. In contrast with the situation for inten¬ 
sity mapping, the constraints are much more sensitive to 
the prior on the clustering bias. The main reason for 
this is that, for IM experiments, the availability of red- 
shift information helps to break the degeneracy between 
the parameters through the scale dependence of the /nl 
term along the line of sight. We find that a ~ 10% error 
on b{z) would be sufficient to minimize the uncertainty 
OR /nl- Since this is compatible with previous smaller- 
scale observations (c.f. Lindsay et al. 2014), we chose this 
value as our fiducial prior on the clustering bias. 

The effect of a prior on the magnification bias is shown 
in the bottom left panel of Fig. 8. We observe a simi¬ 
lar degeneracy with /nl, again mainly due to the lack 
of redshift information, which can only be mitigated by 
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Fig. 9.— Dependence of the uncertainty on /nl (solid lines) and 
^GR (dashed lines) on the minimum scale probed by the survey for 
different flux limits. 


prior measurements of s(z) with an error better than 
As « ±0.1. We have assumed that such an accuracy 
would be achievable using the magnitude-redshift distri¬ 
bution of sources, although this could be an optimistic 
assumption at the highest redshifts. Finally, as in the 
case of intensity mapping, we observe a significant degra¬ 
dation in the uncertainty on /nl if we assume no knowl¬ 
edge about the evolution bias of the sample (bottom right 
panel in Fig. 8). This is again due to the degeneracy in 
the scale dependence of the terms corresponding to both 
quantities (see Sect. 4.1.2), and can be mitigated by mea¬ 
suring /evo to an accuracy of better than A/evo ^ 10. 

In view of this analysis, our final fiducial set of pri¬ 
ors on the bias functions is Ab/b = 0.1, As = 0.1, and 
A/evo = 1- Forecasts for /nl with these priors are listed 
in Table 3 and, with (t(/nl) = 16 for the 5/rJy sam¬ 
ple, are compatible with the results of Ferramacho et al. 
(2014) for their combined sample. This is far worse than 
the constraints possible with multiple tracers, however. 

4.2.3. Systematic uncertainties 

The different frequency range and observational tech¬ 
niques involved in radio astronomy give rise to poten¬ 
tially very different sources of systematics for contin¬ 
uum surveys compared with optical and near-infrared 
surveys. To begin with, the diffuse nature of galactic 
synchrotron emission (the largest galactic foreground at 
radio frequencies) makes it virtually transparent to the 
long interferometer baselines needed for a survey aiming 
to resolve individual sources, and hence the problem of 
galactic foregrounds is greatly ameliorated. On the other 
hand, in order to produce a full-sky catalogue, mosaic- 
ing of the individual pointings must be implemented. If 
the mosaicing pattern and correlations in the noise prop¬ 
erties between pointings are not fully understood, they 
could introduce systematic deviations on large angular 
scales. Ionospheric effects will also be relevant at low 
frequencies, although this should not be a problem in 
the SKAl-MID frequency range. Bright point sources 
would also need to be masked in a non-trivial way, due 
to dynamical range issues causing increased noise in the 


far beam sidelobes. The extent to which this would af¬ 
fect ultra-large scales is instrument-dependent, however. 
Avoiding these systematics might again entail removing 
the smallest multipoles of the power spectrum from the 
analysis; we show how the constraints on /nl depend on 
the minimum multipole .^min in Fig- 9- 

4.3. Spectroscopic redshift surveys 

Spectroscopic galaxy redshift surveys in the optical 
and near-infrared represent the current state-of-the-art 
in large-scale structure observations (Percival 2014). The 
premise is simple: to detect redshifted emission lines 
from as many resolved sources as possible, over as large 
a spatial volume as possible, and then to reconstruct the 
3D distribution of sources in redshift-space. Assuming 
that the source population traces the underlying mat¬ 
ter density field, and samples it sufficiently well, one can 
then constrain the statistical properties of the cosmolog¬ 
ical matter distribution as a function of scale and red¬ 
shift. Because the positions and redshifts of the sources 
can both be measured with high precision, information 
about the matter distribution is retained in the angular 
and radial directions, unlike with more lossy techniques 
like intensity mapping and photometric redshifts. The 
downside is that taking high-resolution spectra for a large 
number of sources is extremely time-consuming. 

For ultra-large scales, we are primarily interested in 
beating-down sample variance by increasing the effec¬ 
tive survey volume. This requires a wide survey area, 
broad redshift coverage (preferentially at higher red¬ 
shifts, where longer wavelength Fourier modes are ac¬ 
cessible), and a sufficiently high signal intensity/source 
density that noise will be sub-dominant. Of the meth¬ 
ods considered in this paper, spectroscopic galaxy sur¬ 
veys appear the least well-suited to the task of sur¬ 
veying extremely large volumes. Spectroscopy is time- 
consuming, and while precision redshift information is 
extremely useful for studying baryon acoustic oscillations 
and other smaller-scale phenomena, it is less necessary 
for the largest scales, apart from allowing the survey to 
be split into a larger number of redshift bins (c.f. inten¬ 
sity mapping). Nevertheless, spectroscopic galaxy sur¬ 
veys are the most developed of the methods, and so have 
comparatively well-understood systematic effects. The 
additional small-scale information also allows them to 
measure basic cosmological parameters more accurately, 
which helps to break parameter degeneracies. 

We base the specihcations of our reference spectro¬ 
scopic survey on a large emission-line galaxy (ELG) 
survey along the same lines as Euclid (Laureijs et al. 
2011 ), a satellite mission with a near-infrared spectro¬ 
graph that will detect ~ 6 x 10^ Ha-emitting galaxies 
over 15,000 deg^ in the redshift range 0.65 z < 2.05. 
A similar ELG survey will be performed by DESI (for¬ 
merly BigBOSS), which will target [Oil] galaxies out to 
z = 1.7 over 14,000 deg^ (Schlegel et al. 2011). These 
are the largest planned spectroscopic surveys,^ but have 
the smallest area of the experiments considered here, and 
the lowest maximum redshift apart from LSST-red (see 
Section 4.4). 

^ A proposed HI galaxy survey with Phase 2 of the SKA would 
detect ~ 10® galaxies over ~30,000 deg® for redshifts 0 < z < 2 
(Yahya et al. 2015), but is considerably more futuristic. 
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4.3.1. Constraints on relativistic effects 

The most optimistic forecast for cgr, marginalizing 
over /nl only, yields cr(eGR) = 2.6 for the Ha survey’s 
narrowest redshift binning (Az = 0.025). This result is 
insensitive to bin width, increasing only slightly to 2.7 
for the widest binning (Az = 0.1); the additional in¬ 
formation gained by decreasing the bin width is mostly 
confined to small scales, where the relativistic effects are 
essentially negligible. The correlation between cgr and 
/nl is very weak, and there is no change in the con¬ 
straint whether /nl is marginalized or fixed. The survey 
is also quite close to its ideal (sample variance-limited) 
performance, with (T(eGR) improving only slightly to 2.2 
in the limit N( ^ 0 for Az = 0.025. These results, to¬ 
gether with the constraints corresponding to a cosmic 
variance-limited results (assuming Af = 0 and /sky = 1) 
are summarized in Table 3, and have a qualitatively sim¬ 
ilar behaviour. 

As with the two previous surveys, then, the relativis¬ 
tic effects are undetectable. This is despite the relatively 
high magnification bias of the Ha galaxies, which boosts 
the size of some of the relativistic correction terms. The 
Ha survey’s sky coverage and maximum redshift are 
smaller than for the other surveys though, which weak¬ 
ens its constraining power. An experiment with the same 
specifications as the Ha survey but covering twice the 
area (30,000 deg^) would give cr(eGR) = 1-8 (compared 
with 1.4 in the CV-limited case), which is still not enough 
to gain a detection. 

Nevertheless, the Ha surveys’s forecast constraint of 
o'(eGR) = 2.6 is the best so far, and the CV-limited fig¬ 
ure of 1.36 is markedly better than SKAl-MID’s value of 
1.97, despite the IM survey having a significantly wider 
redshift range. The enhanced performance of the spec¬ 
troscopic survey over intensity mapping is primarily due 
to the different behaviour of the bias functions, particu¬ 
larly s(z), which caused many of the relativistic effects 
to cancel for the IM survey. 

As before, we also forecast for the detectability of the 
large-scale leasing effect, parametrized by ewL- After re¬ 
stricting to modes i < 100 and marginalizing only over 
/nl, we find cr(ewL) = 0.19 for all three choices of red¬ 
shift bin width - a strong detection (see Table 3). 

4.3.2. Constraints on primordial non-Caussianity 

As with the previous two probes, our /nl forecasts 
include a Planck CMB prior and priors on the bias func¬ 
tions {Ab/b = 0.1, As = 1, A/evo = !)■ The results are 
shown in Table 3 for the Az = 0.025 redshift binning. 

The forecast constraint from the Ha survey is cr(/NL) = 
6.8, which is worse than the intensity mapping survey by 
a factor of ^ 2. While the Ha survey has a consistently 
higher bias (which enhances the non-Gaussian bias sig¬ 
nal, (X 5—1), it covers a narrower redshift range and 
smaller area than the IM survey, so ultimately loses out 
when the higher-redshift bins of the IM survey are taken 
into account (see Fig. 16, below). The difference in per¬ 
formance remains in the CV-limited case, again mostly 
due to the wider redshift range of the IM survey. 

The Ha constraint degrades only slightly to cr(/NL) = 
7.2 for the widest redshift binning, Az = 0.1. Simi¬ 
larly, it is only weakly sensitive to £max, improving from 
o'(/nl) = 7.2 for ^inax = 200 to 6.6 for £max = 1000 
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Fig. 10.— ID marginal errors on /nl for ^ Ho spectroscopic 
survey as a function of the prior on /evo- A Planck prior on the 
cosmological parameters is also included. 

(both for Az = 0.025). The addition of significantly more 
small-scale information in both the radial and transverse 
directions is therefore only mildly benehcial. 

As with the IM survey, there is a reasonably strong 
degeneracy between /nl and /evo, predominantly for the 
highest-redshift nuisance parameter bin. Fig. 10 shows 
the effect of changing the prior on /evo - an 0{1) prior 
is sufficient to completely break the degeneracy. The 
results are insensitive to the prior on the magnification 
bias, and there is no gain to be had from tightening the 
bias prior until a very low level of Ab/b < 1% is reached. 

4.3.3. Systematic uncertainties 

Spectroscopic surveys are prone to systematic effects 
on large angular scales. Redshift surveys commonly con¬ 
sist of samples from several non-contiguous fields, sur¬ 
veyed during different observing seasons and possibly 
even with different instruments, which can make it tricky 
to patch them together into a single coherent survey vol¬ 
ume. They may also suffer from the problem of not hav¬ 
ing a homogeneous magnitude limit, i.e. the magnitude 
cuts vary, and cannot easily be mapped onto a substan¬ 
tial and complete 3D volume of the sky. 

Galactic extinction is a dominant source of system¬ 
atic error on large angular scales. Dust in our galaxy 
changes the overall true flux cut of the survey, and in¬ 
troduces number density fluctuations that vary with the 
shape of the Galaxy. This must be corrected for (e.g. by 
fitting an extinction template) to avoid biasing the in¬ 
ferred large-scale power. Variations in airmass and see¬ 
ing also affect the number of photons reaching the de¬ 
tector in a way that is correlated with the elevation of 
the telescope. This induces additional dispersion in the 
magnitude of the measured galaxies, and the ability to 
distinguish them from stars. 

Stars themselves are problematic. For example, a small 
fraction of the observed sources may in fact be misiden- 
tified stars that contaminate the galaxy sample. Stars 
also obscure regions of the sky of order the size of the 
point-spread function, which reduces the observed den¬ 
sity of galaxies nearby. While this should be a small 
effect for an individual star, which will mask an area of 
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Fig. 11.— ID marginal errors on /nl and eQR as a function of the 
minimum usable spherical harmonic mode, ^mim foi" the Hck survey 
(Az = 0.025). For the /nl results, we have added a Planck prior 
to break degeneracies with other cosmological parameters, as well 
as priors on the bias functions {Ab/b = 0.1, As = 1, A/evo = !)• 

^ 10“® of a degree, the total obscured area can be sig¬ 
nificant given the large stellar density (which grows to¬ 
wards the galactic plane). In fact, stellar contamination 
was found to be a dominant source of systematic error in 
the recent analysis of the BOSS data (Ross et al. 2011), 
where it introduced a significant bias in the measure¬ 
ment of the correlation function on large angular scales if 
left uncorrected. This bias was well above the statistical 
uncertainty, to the extent that the correlation functions 
measured in the Northern and Southern Galactic hemi¬ 
spheres of the survey were inconsistent with one other. 

While many of these effects can lead to fluctuations in 
the number density of galaxies as a function of redshift, 
inducing systematics along the radial direction, the dom¬ 
inant effect is on large angular scales, reducing the effec¬ 
tive area of the survey and hampering accurate recovery 
of the lowest £ modes. Fig. 11 shows how the forecast 
constraints on egr and /nl depend on the minimum re¬ 
coverable £ mode of the survey. There is a rapid loss of 
information on both parameters as £min increases, with 
^■(/nl) doubling from 6.8 for £^i^ = 2 to around 13 at 
£min = 10. The degradation is similar for cr (egr), which 
also doubles in the same range. As such, future spectro¬ 
scopic surveys will likely need excellent control over large 
scale systematics if they are to be used to constrain /nl ■ 

4.4. Photometric redshift surveys 

One of the main drawbacks of spectroscopic surveys 
is the long integration times needed to resolve galaxy 
spectra sufficiently well to yield a good redshift estimate. 
Because of this, the number of targets selected for spec¬ 
troscopic follow-up is usually much smaller than the to¬ 
tal imaged sample, which significantly limits the survey 
depth and number density that can be achieved. 

In a photometric redshift survey, each galaxy that is 
detected with a sufficiently high signal-to-noise is imaged 
in a small number of wide frequency bands. This pro¬ 
vides a very coarse measurement of the galaxy’s spec¬ 
trum, convolved with the bandpass of each band, which 
can be used to statistically infer its redshift. These pho¬ 


tometric redshifts (usually abbreviated “photo-z’s”) have 
much larger uncertainties than their spectroscopic coun¬ 
terparts, and most of the information about gravitational 
clustering on radial scales is lost. Photo-zs can be re¬ 
covered for much fainter galaxies than spectroscopic red- 
shifts, however, and so photometric surveys have the abil¬ 
ity to cover significantly larger volumes. This potentially 
makes them more suitable for constraining cosmological 
observables on ultra-large scales. 

The first wide-area, deep photometric surveys are al¬ 
ready underway (Kaiser et al. 2002; Dark Energy Survey 
Collaboration 2005). Their results will pave the way for 
the Large Synoptic Survey Telescope (LSST, LSST Col¬ 
laboration et al. 2009), which will surpass them in terms 
of area, depth, and angular resolution (although the lat¬ 
ter is not a critical factor for this work). We have thus 
chosen to produce Fisher forecasts for LSST as the best 
photometric survey that we will have access to in the 
foreseeable future. 

We have assumed that the LSST will observe two sepa¬ 
rate galaxy populations: early-type (“red”) galaxies, and 
late-type (“blue”) galaxies. Although this is a simplis¬ 
tic picture, it allows us to study the effect that different 
properties of the sample will have on the final constraints: 

• Early-type galaxies form preferentially in high-density 
regions, and are associated with high-mass haloes. 
They are therefore more highly biased than blue galax¬ 
ies, which is desirable for measuring /nl- 

The number density of red galaxies decays very fast be¬ 
yond redshift z ~ 1. On the one hand, the drop should 
be associated with a larger evolution bias, which could 
enhance the amplitude of the relativistic terms. On 
the other, it limits the largest scales that the red 
sample can probe. Finally, the spectra of red galax¬ 
ies show prominent features, most importantly the 
4000A Balmer break. These features are easy to lo¬ 
cate, even using only photometric information, and 
therefore photo-z’s for red galaxies are more accurate 
on average. 

• Blue galaxies are found in lower density regions and 
correspond to lower-mass haloes. They are therefore 
more faithful tracers of the total matter density field, 
and have a lower bias, which impacts their usefulness 
for measuring /nl- LSST should be able to observe a 
significant number of blue galaxies up to much higher 
redshifts than the red population (z ~ 3) however, 
so their large-scale clustering properties can be stud¬ 
ied more accurately. Finally, photometric redshifts for 
blue galaxies will be more uncertain than those for red 
galaxies, as discussed above. 

We will provide forecasts for two samples: a sample of 
red galaxies only, labeled “red”, and a sample containing 
all of the galaxies observed by LSST (red -I- blue), which 
we will call the “full” sample. Although LSST will be 
able to detect galaxies down to a magnitude limit of 27.5 
in the r-band, it is not clear that the photometric redshift 
requirements will be satisfied for this survey depth. We 
have therefore adopted a more stringent magnitude cut 
of i < 25.3, corresponding to the so-called LSST “gold” 
sample (LSST Collaboration et al. 2009). We have fur¬ 
ther assumed that LSST will cover the whole southern 
hemisphere (/sky = 0.5). 
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prior assumed for /evo for the “red” and “full” samples. 

4.4.1. Constraints on relativistic effects 

As with the previous experiments, we start by explor¬ 
ing the possibility of detecting the contribution of the 
relativistic terms to the clustering of LSST galaxies in 
the best-case scenario, by marginalizing over only /nl 
whilst keeping all other cosmological parameters fixed to 
their fiducial values. Two main differences with respect 
to the previous tracers give some hope for detecting cgr 
with LSST. First of all, the sharp decay in the number 
density of red galaxies can enhance the amplitude of the 
relativistic terms thanks to the large value of /evo- Also, 
LSST covers a wider survey area and redshift range than 
spectroscopic surveys, so has access to larger scales. 

The results are summarized in Table 3. Even though 
the higher value of /evo for red galaxies helps to decrease 
the forecast uncertainty on cgr, it is still impossible to 
detect relativistic effects using either sample; cr(eGR) = 
1.4 and 2.3 for the red and full samples respectively. The 
red sample nevertheless produces the best constraint on 
CGR of any of the surveys considered above. 

As before, we predict the detectability of the large- 
scale magnification lensing by marginalizing only over 
/nl and using a small-scale cutoff fmax = 100. Our re¬ 
sults (see Table 3) show that this effect should be clearly 
detectable (well above 5a) for both the “red” and “full” 
samples. 

4.4.2. Constraints on primordial non-Caussianity 

Even though photometric redshifts erase most of the 
clustering signal on all but the largest radial scales, they 
are still sufhcient to enable a tomographic analysis of 
galaxy clustering to be performed. This helps immensely 
in breaking many of the degeneracies reported for con¬ 
tinuum surveys (which retain essentially no radial infor¬ 
mation) . 

We studied the importance of breaking these degen¬ 
eracies by again calculating ct(/nl) as a function of the 
priors on the bias parameters, finding that the forecasted 
uncertainty is almost completely insensitive to any priors 
on the clustering and magnification biases, b{z) and s{z). 
For the reasons outlined in Section 4.1.2, this behaviour 
does not follow for the evolution bias, so it is useful to 


explore the prior constraints on /evo that are required 
in order to optimize the measurement of /nl- Figure 
12 shows the dependence of cr(/NL) on a constant Gaus¬ 
sian prior imposed on /evo- The degeneracy between the 
parameters can be largely mitigated by measuring the 
evolution bias with an accuracy of A/evo ^ 1- 

As in the previous cases, we produced our final fore¬ 
casts for /nl by assuming Planck CMB priors for the cos¬ 
mological parameters, a 10% uncertainty on the cluster¬ 
ing bias, and priors of As = 1 and A/evo = 1- The final 
results are summarised in Table 3. LSST should be able 
to impose very tight constraints of cr(/NL) — 1-7 using 
galaxy clustering autocorrelations (single-tracer) alone. 

4.4.3. Systematic uncertainties 

Most of the sources of systematics that affect photo¬ 
metric redshift surveys are exactly the same as for their 
spectroscopic counterparts: galactic extinction, varia¬ 
tions in sky brightness, seeing, and stellar contamination 
(both due to stars affecting the local observed number 
density of galaxies, and stars erroneously being included 
in the galaxy sample). All of these effects can potentially 
contaminate the signal measured on large angular scales. 
Figure 13 shows the degradation in the constraints on 
/nl when the largest scales are omitted in the analysis. 
Limiting ourselves to scales £ > 10 would increase our 
best-case error bars by over 50%, to cr(/NL) 2.8. 

The use of photometric redshifts also carries its own 
systematic effects. In order to obtain a reliable estimate 
of the power spectrum of the galaxy density field that 
we can use to constrain large scales, it is necessary to 
have a sufficiently accurate model of the window func¬ 
tion for every redshift bin. Unless a spectroscopic sub¬ 
sample drawn from the same distribution as the photo¬ 
metric one is available, which is rarely the case, deriving 
a correct model for the true redshift distribution N{z) is 
a challenging task, although it has been noted that this 
issue could be ameliorated by cross-correlating the pho¬ 
tometric sample with any spectroscopic survey (Newman 
2008) . The presence of photometric redshift outliers can 
also modify the tails of the photo-z distribution, which 
affects the shape of the redshift window functions. The 
level to which this effect is problematic will depend on 
how accurately the photo-z pdf can be characterized. 

5. DISCUSSION 

It has been argued that general relativistic corrections 
to the number density of galaxies should be observable 
with future cosmological surveys. In particular, ultra 
large-scale features in the power spectrum of density fluc¬ 
tuations could in principle be detectable with deep and 
wide surveys such as those that will be carried out by 
the next generation of experiments. In this paper we set 
out to systematically examine this claim for the most 
relevant surveys planned for the next decade or so. At 
the same time, we performed forecasts for the expected 
constraints on the scale-dependent bias that arises from 
primordial non-Gaussianity, another key effect on ultra- 
large scales. Our analysis uses a more rigorous formalism 
than is usually followed, based on computing all possi¬ 
ble angular cross-correlations between different redshift 
bins instead of using an approximate 3-D power spec¬ 
trum approach, and avoiding the flat-sky and Limber 
approximations. 
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Experiment type 

Experiment 

irf/NL) 

irfeGR) 

o-fewL) 

Intensity mapping 

SKAl-MID 

3.01 

2.75 

— 


(w. 1.5x bias) 

0.90 

1.90 

— 


CV-limited 

1.68 

1.97 

— 

Continuum survey 

•Scut — 10//Jy 

18.5 

26.7 

1.90 


•Scut — 5p.Jy 

16.0 

24.6 

0.57 


•Scut — lp"Iy 

11.8 

17.1 

0.25 

Spectroscopic survey 

Ha survey 

6.64 

2.57 

0.19 


CV-limited 

3.02 

1.35 

0.10 

Photometric survey 

LSST-red 

4.32 

1.41 

0.14 


LSST-full 

1.71 

2.33 

0.04 


TABLE 3 

Forecasted constraints on /nl, egri and cwl for the different experiments explored in this work. 



Fig. 13. — Dependence of the uncertainty on /nl (solid lines) 
and egr (dashed lines) on the minimum scale probed by LSST for 
the two samples considered here. 


Our main conclusion is that, if we restrict ourselves to 
the single-tracer power spectrum of the density fluctu¬ 
ations, in either two or three dimensions, previously- 
undetected contributions to the power spectrum of source 
number counts are completely unobservable. Note that 
we have labelled these terms as ‘GR effects’ in this work, 
thus excluding the lensing magnification term, which we 
have treated separately due to its very different proper¬ 
ties (see below). 

In hindsight, this result is not surprising.® A number 
of papers have previously attempted to forecast the op¬ 
timal constraints on /nl from a variety of surveys, with 
the general conclusion being that, at best, one can de¬ 
tect a value of /nl ~ 1 — 2 at one sigma. Our work 
has confirmed these results. Given that egr has a simi¬ 
lar (although not identical) effect on the power spectrum 
as /nl ~ 1, we expect the same level of sensitivity to 
relativistic effects. If we are to aspire to a statistically 
significant detection of the relativistic effects, we would 
need a sensitivity of cr(eGR) ^ 0.1 — 0.2, which is clearly 

® See e.g. Challinor Lewis (20fl); Jeong et al. (2012); Yoo 
et al. (2012) for similar statements using less quantitative analyses. 


unachievable with any of the single-tracer survey tech¬ 
niques and strategies analysed here. 

It is interesting to look at each of the survey techniques 

in turn to see why they fall short of our desired target: 

• On the face of it, intensity mapping is a particularly 
promising approach to efficiently surveying large vol¬ 
umes of the Universe - it can simultaneously produce 
very deep surveys and cover large areas. One would 
expect this to be ideal for constraining both cgr and 
/nl- Relative to the other techniques, intensity map¬ 
ping suffers from the fact that one of the substantial 
corrections on large scales - the perturbation to an¬ 
gular distances - is absent. As was shown in Section 
2 .1, because one is measuring an intensity rather than 
source number density, there is an exact cancellation 
of the lensing contributions to both the number den¬ 
sity and angular diameter distance corrections. This 
has a significant impact on the size of the relativistic 
correction signal, and thus the detectability of egr- 

Intensity mapping can be used to obtain reasonably 
tight constraints on /nl, although there too it is placed 
at a disadvantage by the “Gaussian” clustering bias, 
b'^, for neutral hydrogen being smaller than for other 
types of probe. Note that while most halo-based mod¬ 
els predict a low HI bias, measurements of the cluster¬ 
ing of damped Lyman-a systems carried out by Font- 
Ribera et al. (2012) point towards a significantly higher 
value at z > 2. If we multiply our fiducial bias func¬ 
tion by a factor of 1.5x (so that 6 « 2 at z = 2.2), we 
obtain a significantly better constraint of cr(/NL) = 0-9 
for an IM survey going out to z = 3.5. Our forecasts 
for /nl are very sensitive to the fiducial bias model, 
and therefore could change significantly with better 
empirical measurements. 

We must also point out that the survey specifications 
we assumed are such that the measurements of the 
power spectra are only cosmic variance-limited on the 
very largest scales. It might be possible to improve 
the constraints on /nl (by up to 30% in the most op¬ 
timistic case) by reducing the noise (e.g. by increasing 
the survey time). 

• Radio continuum surveys^ while efficient at accessing 
large volumes, are remarkably poor at constraining 
both egr and /nl in a single-tracer context. The 
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Fig. 14.— Comoving volume and redshift range covered by the 
experiments considered in this work. 


loss of all radial information through projection over 
a wide redshift range has a significant toll on their 
ability to discriminate between different effects in the 
power spectrum. If we are to benefit from continuum 
surveys, a more sophisticated multi-tracer approach is 
needed, as we will discuss below. 

• We also looked at the ability of a spectroscopic survey 
to constrain large-scale features in the power spectra. 
The advantage of this type of survey is ostensibly their 
ability to obtain high-resolution measurements in both 
the angular and radial directions - more so than in any 
other type of survey considered here, although in prac¬ 
tice this comes at a price. Spectroscopic surveys are 
time-intensive, and so surveys are limited to smaller 
areas of the sky and shallower depths. The average 
number density of (usable) galaxies is also lower than 
for other surveys, and hence the effective noise on large 
angular scales can be more substantial. As was the 
case with intensity mapping, our forecasts for /nl de¬ 
pend crucially on the fiducial clustering bias, which for 
Ha emitters is also relatively low. 

• Deep and wide photometric surveys seem to be the 
most effective method for probing large scales, for the 
parameters we considered here. The loss of resolu¬ 
tion along the radial direction (as compared to a spec¬ 
troscopic survey) is compensated by the significantly 
wider redshift coverage and larger survey area. Photo¬ 
metric surveys are also a more rapid way of counting 
galaxies, so source number densities are higher than for 
their spectroscopic surveys contemporaries. Finally, 
the clustering bias for the sources that LSST will ob¬ 
serve is significantly higher than for HI and Ha galax¬ 
ies, boosting its ability to detect /nl substantially. 

Figure 14 compares the redshift range and comoving 
volume probed by the various experiments studied here. 
Measuring ultra large-scale observables depends criti¬ 
cally on the ability to cover very large volumes, but we 
have seen here that this is far from the only factor. Even 
though an SKAl continuum survey should be able to ac¬ 
cess the largest volume of any of the surveys, its inabil¬ 
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Fig. 15.— Regions in the space of transverse and parallel 
wavenumbers {k±, A:||) accessible for various surveys. Red: region 
accessible to a photometric redshift survey with ag = 0.05. The 
large A:||-regime is lost due to the inaccuracy of photo- 2 :’s. Blue: 
region accessible by an SKAl-MID IM survey in single-dish mode 
(Udish = 15m). Small transverse scales are inaccessible due to the 
beam width. White (containing red and blue regions): region ac¬ 
cessible to a full-sky spectroscopic survey covering up to 2 = 2. 
Contours: the relative amplitude of the contribution to the three- 
dimensional power spectrum due to primordial non-Gaussianity, 
with /nl = 10, for these three experiments, in black (spectro¬ 
scopic), red (photometric), and blue (intensity mapping). Note 
that this amplitude depends on the clustering bias of each tracer. 


ity to use radial information prevents it from achieving a 
competitive measurement of /nl without the use of the 
multi-tracer technique. 

It is therefore also relevant to compare the range of ra¬ 
dial and angular scales covered by each experiment. We 
do this in Figure 15 for intensity mapping, photometric, 
and spectroscopic surveys covering redshifts z < 2. The 
grey hashed region corresponds to inaccessible scales, ei¬ 
ther larger than the survey volume or smaller than the 
non-linear scale. While spectroscopic surveys have essen¬ 
tially complete access to the A:|| — k± plane, photometric 
surveys and intensity mapping are limited to only large 
radial or angular scales respectively. 

This is not necessarily an important inconvenience 
for ultra large-scale observables. The coloured contours 
show the relative amplitude of the /nl signal for each of 
these surveys (spectroscopic in black, photometric in red 
and IM in blue) for /nl = 10. In the large-fc region that 
only spectroscopic surveys have access to, the amplitude 
of the signal is relatively low. Note also that radial and 
angular scales should not be treated equally in this plot; 
simply by a dimensionality argument (two angular di¬ 
mensions vs. one radial), better angular coverage will 
usually be more advantageous. 

Finally, our ability to measure any signal depends crit¬ 
ically on its amplitude. In the case of primordial non- 
Gaussianity, this depends on the sample’s clustering bias. 
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Fig. 16.— Constraints on /nl a function of the maximum 
redshift covered by each of the surveys considered here. The co¬ 
moving volume per steradian at each z is indicated on the upper 
x-axis. 

since A6 ng oc (6—1). This dependence can be observed 
in Figure 16, where we show the forecasted uncertainty 
on /nl as a function of the maximum redshift covered by 
each survey. Even though intensity mapping has the po¬ 
tential to cover the largest volume while also preserving 
radial information, the improvement of cr(/NL) with z is 
signihcantly slower, in particular in the range z ~ 1 — 1.5, 
where the HI bias is very close to unity. 

We emphasise that all of our forecasts depend on a 
variety of astrophysical model assumptions. By this we 
mean that we have had to predict the number densi¬ 
ties and biases for the surveys as a function of redshift, 
based on existing data and simulations. Getting these 
assumptions correct is key for accurate forecasting, and 
substantial work will have to be done - for all types of 
surveys - to better estimate these quantities. Neverthe¬ 
less, we are confident that our models are sufficiently 
representative that our broad point is correct, and that 
the numbers we present here give a fair representation 
of what to expect from future surveys. On this point 
it is also worth noting that it is in principle possible to 
further constrain the level of primordial non-Gaussianity 
from measurements of higher-order correlations. Such 
measurements are extremely challenging at present due 
to the large theoretical uncertainties (e.g. in the form 
of the bias in the presence primordial non-Gaussianity, 
the form of the bispectrum and its covariance in redshift 
space, the value of the non-linear clustering bias etc.). 
Further studies of the three-point function might even¬ 
tually make such measurements possible, possibly super¬ 
seding the forecasts presented here. 

Conservatively, we did not include the lensing magnih- 
cation contribution to the power spectrum as one of the 
effects parametrized by cgr, even though it is a fully 
relativistic effect. This is because magnification has a 
significant amplitude on sub-horizon scales, and has in 
fact already been detected by cross-correlating pairs of 
distant tracers. This is qualitatively different to the situ¬ 
ation for the other relativistic terms, which are significant 


Fig. 17. — Constraints on ewL as a function of the prior on the 
magnification bias. The same parameters, and priors for the other 
bias functions, were used as in the /nl analysis, except for the 
continuum survey, where the clustering bias was not marginalized. 

only on ultra-large scales, and which have not been mea¬ 
sured yet in large-scale structure. Figure 1 indicates that 
the lensing magnification contribution to the power spec¬ 
trum can dominate the GR terms parametrized by cgr 
at ultra-large scales, however. This large-scale contribu¬ 
tion has not been detected in current small-volume sur¬ 
veys, but previous work has forecasted for its detectabil¬ 
ity (Yoo 2009; Namikawa et al. 2011; Yang & Zhang 2011; 
Yang et al. 2015; Montanari & Durrer 2015). By defin¬ 
ing a parameter, ewL; corresponding to the amplitude of 
the magnification lensing term alone, we have quantita¬ 
tively verified its detectability, showing that the large- 
scale lensing contribution should be detectable above 5cr 
in all relevant experiments. It has also been shown by 
Namikawa et al. (2011); Lopez-Honorez et al. (2012) and 
Camera et al. (2015a) that omission of the lensing -j- GR 
terms leads to a bias in the recovered value of /nl- This 
was predicted to be at the ~ 3a level for an SKA Phase 
2 HI spectroscopic survey (Camera et al. 2015a). Our 
analysis shows that cgr cannot be responsible for this 
bias, leading to the conclusion that it must be almost 
exclusively due to the ultra-large scale lensing term. 

We also tested the consistency of our results with the 
recent work by Montanari & Durrer (2015), who find 
^ 1% constraints on ewL (their parameter /?) for a Eu¬ 
clid photometric survey. Marginalising over the same 
set of cosmological parameters as them (except the neu¬ 
trino mass, and choosing t'max = 1000, we obtain 

o'(ewL) = 0.06 for the Ha spectroscopic survey with 
Az = 0.1 bins. This is consistent with their results to 
within a factor of a few, which is reasonable given the 
differences in survey specifications, fiducial magnification 
bias functions, and maximum £ used. 

As a further test we then repeated this analysis, but 
restricted ourselves to only large-scale modes, £ < 100, 
and marginalized over the bias nuisance parameters as 
well. While one would likely fix the bias functions to 
their fiducial forms when attempting a first detection of 
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the large-scale lensing effect, the bias uncertainties must 
be taken into account for precision measurements of ewL 
(e.g. if used to test GR, as discussed in Montanari & 
Durrer 2015). We find a strong correlation between ewL 
and the magnification bias, especially at higher redshifts, 
which is to be expected given that both factors multiply 
the lensing term in the number count expression. Fig. 17 
shows how the ewL constraint depends on the magnifica¬ 
tion bias prior for the various surveys; the spectroscopic 
and photometric surveys both require As 1 to measure 
ewL to better than 100%, and As of order a few xl0“^ 
to reach their optimal constraints. Continuum surveys 
require a stronger prior of ~ 0.1 just to reach a 100% 
constraint on ewL (for 1 and 5 /rJy flux limits), and are 
also subject to strong correlations with other bias pa¬ 
rameters, as discussed in Section 4.2.2. The continuum 
constraints shown in Fig. 17 were derived for fixed clus¬ 
tering bias, but if this is marginalized over (with a 10% 
relative prior on b), (T(ewL) increases by a factor of ^ 2. 

The fact that we are unable to detect cgr from the 
single-tracer power spectrum should not at all lead us 
to give up hope of seeing the GR corrections, however. 
Indeed, this is just the first step in identifying the most 
effective observables for teasing out the ultra large-scale 
effects. In order to beat down the cosmic variance that 
is a fundamental barrier to single-tracer detectability, 
one must use multi-tracer techniques. In some sense 
these divide out the stochastic part of the perturbation 
field, avoiding the effects of cosmic variance for certain 
(non-stochastic) quantities. By cross-correlating differ¬ 
ent tracers, with different bias functions, it is possible 


to isolate a number of terms from the scale dependence 
due to non-Gaussianity, relativistic effects, and growth 
of structure, in such a way as to obtain much tighter 
constraints than those from the overall power spectrum 
alone. It has been shown that multi-tracer techniques 
applied to continuum surveys can lead to almost an or¬ 
der of magnitude improvement in the detection of /nl, 
for example. We expect that the GR effects will also 
be detectable via this approach (Yoo et al. 2012). A 
systematic analysis of multi-tracer techniques in the sur¬ 
veys described here is, therefore, an obvious next step 
in trying to identify robust methods for measuring ultra 
large-scale effects. 

Note added — While this paper was being finalised, 
Raccanelli et al. (2015) appeared, which discusses some 
related topics. 
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APPENDIX 

A. MODIFICATIONS TO CLASS 

In order to compute the power spectra used for the Fisher forecasts, we used the public code CLASS (Lesgourgues 
2011). In its current version, CLASS encompasses the extension CLASSgal (Di Dio et al. 2013), which can be used to 
compute the transfer functions A* in Eqs. (39-44). The public version of the code is easy to install, run, and modify, 
and we encourage its use; however, a number of modifications had to be implemented in order to make it usable for 
our work. We have made our modified version of the code publicly available at http://intensitymapping .physics. 
ox.ac.uk/codes.html, and document the main changes here: 

• The terms A^, A^^ and A^®'^ in Eqs. (39-44) can be computationally very demanding to compute. The main 
reason for this is the wide redshift range covered by their window functions - these terms correspond to integrated 
effects along the photon trajectory from the source to the observer. In the current version of CLASS, including 
these terms in the calculation of for a single redshift bin at z = 0.5 takes about 10 minutes running on a 
modern 4-core computer. This makes including these terms for the large number of bins used in this project (e.g. 
100 bins for intensity mapping) prohibitively expensive, so we invested some time in speeding up the calculation 
of these terms. 

Two modifications were implemented. First of all, the code was parallelized for distributed memory machines 
using MPI, so that each node computes the transfer functions of a different set of redshift bins. Secondly, some 
redundant calculations were circumvented by precomputing the window functions and IU^®'^(77) 

in Eq. (45) and storing them in memory. This last modification speeds up the computation of the integrated 
terms by a factor of ^ 4, although it requires more memory. 

• We implemented the effect of primordial non-Gaussianity by including the scale-dependent contribution to the 
bias in Eq. (47). 

• We modified the I/O system for the bias parameters b{z), s(z) and fevoiz). These can now be supplied as tabu¬ 
lated z-dependent functions. Eurthermore, fevo must now be provided separately from the redshift distribution, 
iV(z). 

• We implemented the possibility of adding an extra parameter for each redshift bin that corresponds to the 
photo-z uncertainty, so that the window function in each bin can be computed as in Eq. (B26). In doing this, we 
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also modified the I/O system for defining the redshift bins. The bin properties must now be supplied in different 
columns in a separate text file. We believe this system is better suited for a large number of redshift bins. 

• Finally, we included the extra parameter cgr, used in this paper to parametrize the amplitude of the relativistic 
corrections. This is not a general-purpose modification. 


B. SURVEY SPECIFICATIONS 

In this Appendix we provide the detailed specifications for all four of our reference surveys. The codes used to 
generate the fiducial redshift distributions and bias functions have been made available online.® 


B.I. Intensity mapping 
B.l.I. Noise model and redshift binning 

For an IM autocorrelation experiment, the simplest case is to assume that the noise is uncorrelated between different 
frequency channels, and has a white noise power spectrum: 


= Sijff 


2 

sr ■ 


(Bl) 


Here is the noise variance per steradian, and can be calculated as follows: the noise per-pointing can be estimated 
as the rms temperature fluctuation of the system, TLys, scaled by the number of independent samples measured (given 
by 5vtp, where Sv is the frequency channel width and tp is the integration time per pointing), tp can be approximated 
by ftot Ar2/(47r/sky), where /sky is the surveyed fraction of the sky, Afl is the solid angle covered in each pointing, and 
ftot is the total survey time. Finally, scaling this by the total number of dishes in the experiment, we obtain the power 
spectrum 


. rsys(^i) 47r /sky 

ftot Adish 


(B2) 


Note that the variance per pointing has been multiplied by AH to obtain the variance per steradian, which cancels 
the dependence on AH. The system temperature receives two contributions, Tgys = Tsky + Tlnst, due to atmospheric 
and background radio emission (Tsky — 60K x (j^/300MHz)“^ ®) and instrumental noise (Tinst)- 
Finally, it is worth noting that two different conventions have been adopted in the literature regarding the effect of 
the beam that defines the angular resolution of the experiment. The difference is in interpreting the beam as smoothing 
the signal on scales beyond the resolution, or as enhancing the noise at those same scales. We use the former, so that 
the model for the total observed power spectrum is: 


Cf = B\ Bl + 


(B3) 


where B^ is the harmonic transform of the instrumental beam in the i—th frequency bin. We have assumed that the 

beams are Gaussian, B} = exp(—.^(£-|-l)0g/2), where 9b is related to the beam FWHM through 0fwhm = 2V21n20B. 
The beam width can be related to the dish diameter approximately as ^fwhm — c/(:^Tldish)- 
In our forecasts for SKAl-MID, we used the instrumental parameters Tinst = 25 AT, /sky = 0.75, ftotai = 10^ h, 
Tdish = 15 m, and Adish = 254, and assumed a minimum frequency of 350 MHz, corresponding to Zmax — 3. 

Modern radio receivers have very high frequency resolution (e.g. Sn ^ O.IMHz), so HI intensity mapping experi¬ 
ments should be able to resolve radial structures on scales much smaller than those relevant for cosmology. We are 
therefore free to choose the width and shape of the redshift bins used for the cosmological analysis. In order to avoid 
inhomogeneous coverage of radial scales, we divide the total frequency band into frequency bins of varying width Ai/(^) 
such that the corresponding comoving size Ay is held constant. We estimated the minimum number of frequency bins 
needed for the constraints on /nl and cgr to converge, finding that at least 100 bins were necessary. This corresponds 
to a radial width of Ay ~ 44Mpc//i. 


B.I.2. Nuisance parameters and redshift evolution 

We model the clustering, magnification, and evolution biases for HI using an approach based on the halo model. We 
first assume that a one-to-one relationship exists between halo mass and HI mass, Mhi = Mhi(M, 2;). The density 
and clustering bias can then be computed as 


Pm{z) 


hm{z) 




dMn{M, z) Mhi(M, z), 


IM, 




I dM n{M, z) b{M, z) , 

/M„i„ Pm{z) 


(B4) 

(B5) 


where n{M,z) is the halo mass function (comoving number density per unit mass), and b{M,z) is the halo bias. The 
background brightness temperature can then be computed in terms of pni using Eq. (26). 


http://intensitymapping.physics.ox.ac.uk/codes.html 





24 


As described in Section 2.1.2, the transverse distance perturbations cancel out for intensity mapping, so that shi(z) = 
2/5 exactly. Furthermore, since we observe the emission from all of the HI in each patch of the sky, the evolution bias 
can be computed directly by differentiating pni with respect to 2 , 


^ d log[pHi(^)] 

d log(l + z) ' 


(B6) 


All that remains is to specify the function Mhi(M, z). As in Bull et al. (2015), we assume a power-law relation 
Mhi(M, z) oc with an exponent a ~ 0.6, and with the normalization set by constraints on Hhi at z = 0.8 from 
Switzer et al. (2013). 


B.2. Radio continuum surveys 
B.2.1. Number counts, bias functions and noise model 

The models for the signal and noise power spectra are very simple compared with the other probes, as the galaxy 
sample is distributed in a single redshift bin, with a window function given by the redshift distribution, 

W{z)ocN{z). (B7) 

We estimate N{z) for the radio sources from empirical estimates of the luminosity functions of the main radio pop¬ 
ulations, since these also contain the necessary information to estimate the magnification and evolution biases. We 
consider four main radio galaxy types: star-forming galaxies (SF), starbursts (SB), radio-quiet quasars (RQQ), and 
Faranoff-Riley type I AGNs (FRI).' The luminosity functions for each population were computed following the pre¬ 
scriptions of Wihnan et al. (2008). 

We will now outline the procedure used to calculate the redshift distribution, s, and /evo in all cases, and refer the 
reader to Wihnan et al. (2008) and the references Yun et al. (2001) (SF and SB), Ueda et al. (2003) (RQQ), and 
Willott et al. (2001) (FRI) for details on the observations that the luminosity functions are based on. These details 
are also summarized in Appendix C. As with optical and IR surveys (see Sections 4.3 and 4.4), the fc-correction to the 
flux measured in a given band is also needed in order to accurately estimate the observed number counts. This can 
be done for radio sources by assuming a particular SED for each population. For this, we again used the models from 
Wilman et al. (2008). 

In its rest frame, a radio source has a luminosity per unit frequency given by 


^ dEe _ ^ (p{v) 

dtedVe ’ 


(B8) 


where is a pivot frequency and is the source SED. This is related to the flux per unit frequency measured by 
the observer by 


S, 


dEp 

dtpdvodAo 


Lv(l + z) 

4TTx'^iz){l + z) 


(1-2S±). 


(B9) 


Consider a radio survey in a given frequency band {v € [v^, I'f]). The average flux density measured for one source is 
defined as 





S. 


dv 

vf-Vo 


(BIO) 


A given source will be detected if its average flux is above the detection limit 5'cut, and therefore all sources with 
a pivot luminosity above a minimum value L//* = T//*(l + 25±) will be included in the sample, where the average 
threshold luminosity is 

rcut/ Q \ _ A'KScntX^{z){l + z)(p{v^) ^ 

and the average SED in the observed band is 


(p{vo{l + z),Vf{l + z)) 


ip{vi,V2) 



ip{v) 


dv 

V 2 -VI' 


(B12) 


Given a pivot frequency the luminosity function ns{z,\nL,^^) at that frequency, and a characteristic SED y}{v), 
the redshift distribution of sources can be computed as 


iV(z) 


(i + z)3Fr(z) 


■V(z,>z/f), 


(B13) 


where N is defined as in Eq. (8). s(z) and fevo{z) are then calculated from N using Eqs. (29) and (28). 


^ We also considered FRII galaxies, but their number density is 
so low that they contribute negligibly to the total number counts. 
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Fig. B1.— Angular number density of radio galaxies as a function of redshift for the combined continuum sample considered here, and 
for different flux cuts. 


For the clustering bias, we follow the same approach used in Wilman et al. (2008), and assign a fixed halo mass to 
each population. The corresponding bias is then found as the halo-model bias for that mass as a function of redshift. 
For this we parametrize the halo bias as in Sheth & Tormen (1999). We have also explored the approach followed in 
Ferramacho et al. (2014), where each population is given a distribution of halo masses rather than a fixed one, and 
the bias is found by averaging over that distribution. No significant differences were found between the approaches, 
so we use the first, simpler one. 

Once the redshift distribution and bias functions have been calculated for each population, we compute them for 
the combined sample as a weighed average of the individual ones, 


N,,tiz) = Y,Naiz), 

a 

(b, S, /evo)tot(^) = '^{b, S, fevo)aiz) , 

„ Ntotiz) 


(B14) 

(B15) 


where a labels the population. Separating the different populations is a very costly observational task. For our 
purposes, the main benefit of doing this is to allow the use of the multi-tracer technique to circumvent cosmic variance. 
Since we have postponed the multi-tracer analysis for future work, we will only report our forecasts here for the 
combined sample of radio sources. Figure B1 shows the redshift distribution for the combined sample for the three 
different detection limits considered here. The luminosity functions, redshift distributions, and bias can be obtained 
using a code that we have made publicly available.® This provides an easy way to obtain number counts and power 
spectra without needing to query the full simulation of Wilman et al. (2008). Our results are consistent with the 
simulation except for the total number counts of star-forming galaxies, where our numbers are a factor of 2.5 x higher. 
This is consistent with what is described in Jarvis et al. (2015), and references therein, however. 

The most relevant source of noise in a clustering analysis of discrete sources is shot noise, determined by the number 
density of sources. The angular number density of radio sources in the sample, Nq, is determined by integrating the 
redshift distribution 

pOO 

Nn= dzN{z), (B16) 

Jo 

and the noise power spectrum is given by 

N, = N^\ (B17) 


B.3. Spectroscopic redshift surveys 
B.3.1. Noise and signal model 

Spectroscopic galaxy surveys measure Eq. (11), the perturbation to flux-limited number counts AN( 2 :,n, > logFcut)- 
This depends on the clustering, magnification, and evolution bias functions for the source population. The latter two 
can be derived from the background luminosity function of the sources, ns{z, logL), given the flux limit and efficiency 
of the survey. Number counts are subject to a Poisson shot noise term that depends on the number density of sources, 
which can also be obtained from the luminosity function. 


http://intensitymapping.physics.ox.ac.uk/codes.html 


8 
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Fig. B2.— Predicted number density of galaxies as a function of redshift for the Ho spectroscopic survey. 


We follow the current set of public specifications for a realistic Her spectroscopic survey, described in Amendola 
et al. (2013). The number counts presented in Table 1.3 of that paper can be approximately reproduced by using the 
best-fit Ha Schechter luminosity function found by Geach et al. (2010), 


POO 

i{z,L>Lcut)= / e(j)* x°‘e~^ dx; x = L/L*{z), 


= 6:8 X 10" 


5.1x1041(1 + ^) (z<1.3) 


(1.3 < z < 2.2) 


erg/s, 


where L is the Ha line luminosity and we have assumed a flux limit of Feut = 3 x 10 i® erg/s/cm^, an efficiency of 
e = 0.45, a faint-end slope of a = —1.35, and comoving number density normalisation (jf = 1.37 x 10-3 Mpc-3. The 
predicted galaxy number density as a function of redshift is shown in Fig. B2. There are significant uncertainties in 
this model, which we account for by marginalizing over a set of bias function nuisance parameters, as explained in 
Sect. 3. Finally, for the clustering bias we use the simplified prescription from Amendola et al. (2013), b{z) = + z, 

which we also subject to the nuisance parametrisation. 

The assumed bias functions for the Ha survey were shown in Fig. 2. The evolution bias is large and negative, and 
grows more negative with redshift until z = 1.3, where there is a discontinuity in the L*{z) model, beyond which 
L*(z) = const, and so /evo = 0. This is not particularly realistic, but is the best that can be done until updated 
constraints on the high-redshift Ha luminosity function become available. The magnification bias grows rapidly with 
redshift, and deviates significantly from s = 0.4 over most of the range, meaning that there is little or no cancellation 
of the transverse scale perturbation as there was in the intensity mapping survey (although this is contingent on the 
uncertain behaviour of the luminosity function model for z > 1.3). The clustering bias is relatively close to unity 
across the entire redshift range, slowly evolving from a minimum of 6 « 1.3 at z = 0.65 to a maximum of 5 « 1.8 at 
z « 2. 

We consider three const ant-width redshift binnings for the selection function over the interval 0.65 < z < 2.05: 
Az = (0.025,0.05,0.10), yielding (56, 28, 14) bins respectively. The target redshift uncertainty for (e.g.) Euclid is 
o'z + 0.001 (1 -I- z) (Laureijs et al. 2011), which is always signihcantly smaller than the narrowest bin width. As such, 
we assume a uniform (tophat) selection function, weighted by the source redshift distribution, N{z). 

Our noise model assumes that only shot noise is relevant, and that effects such as spectroscopy failures and point 
source masking have been taken into account in the survey efficiency, e. The shot noise angular power spectrum is 


Nr 


^ij / 7 — 


N{z) dz. 


(B18) 


with rii measured in units of steradians-^. In our forecasts, we assumed that a wide range of multipoles can be 
recovered, 2 < i < 1000, with no cuts at high or low i due to systematics, non-linear effects and so on (this assumption 
was relaxed in Sect. 4.3.3). 


B.4. Photometric redshift surveys 
B.4.1. Redshift distribution and bias parameters 

In order to compute N{z), s(z), and fevo{z) for our two samples, we need an estimate of the luminosity function for 
both red and blue galaxies, preferably in the r-band, for which the LSST specifications are provided. We describe the 
method used for this task here. 
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For the red sample we follow a method similar to that used by Joachimi et al. (2011). First, an estimate of the 
B-band luminosity function for red galaxies is obtained from Faber et al. (2007) as a Schechter function with constant 
slope a = —0.5, and z-dependent 0* and M*, measured in a number of redshift bins in the interval z S (0.2,1.2). We 
extrapolate the luminosity function to higher/lower redshifts by fitting the values of these parameters, measured by 
Faber et al. (2007), to the models: 

M*(z) = Mo + Miz, (B19) 

<^4^) = Y+ [10”' (B20) 

with Mq = —20.6, Ml = —0.49, (/>o = 1.82, zq = 1.04 and a = 7.17. In order to translate this into an r-band luminosity 
function, we use B — r = 1.32 (Fukugita et al. 1995) and assume that B — r does not evolve significantly for the red 
sample in the redshift range under study. 

For the full sample, we use the r'-band luminosity function found by Gabasch et al. (2006), and approximate r' ~ r. 
This is again given as a Schechter function with constant slope a = —1.33 and z-dependent M* and <()*, for which we 
have used the following parametrizations: 

M*(z) = Mq + a ln(l + z), (B21) 

(t>*{z) = ((()o + </>i z + 4>2 z^) [10“^ Mpc“^] (B22) 


with Mq = —21.49, a = —1.25, tpo = 2.59, tpi = —0.136, and (j )2 = —0.081. The luminosity function for blue galaxies 
is then estimated as the difference between those of the full and red samples. 

An absolute magnitude, M, measured in a given rest-frame band for a galaxy at redshift z is related to the apparent 
magnitude in the observer-frame band, m, by 


M = m — 25 — 5 logiQ 


dL{z) 

1 Mpc h~^ 


-f logio/i- fc(z). 


(B23) 


where ^^(z) is the luminosity distance and k{z) is the ^-correction corresponding to that galaxy’s SED redshifted 
to z. We estimated fc(z) for red and blue galaxies by running the code kcorrect (Blanton & Roweis 2007) on the 
spectra of an elliptical galaxy and a barred spiral galaxy (Sbc) respectively, as measured by Coleman et al. (1980). 
We approximate and extrapolate these /c-corrections as fcred(^) 2.5 z and fcbiue ~ 1.5 z. We verified that these 
parametrizations are compatible with the /c-corrections shown in Fukugita et al. (1995) for both types, and also that 
the final redshift distributions did not vary significantly when other functional forms are used. For a given magnitude 
limit rricutj "we use Eq. (B23) to compute the corresponding luminosity cut Lcut as a function of redshift. The redshift 
distribution, magnification bias, and evolution bias for each population are then estimated using Eqs. (30), (28), and 
(29) respectively, with the luminosity functions described above. 

Regarding the clustering bias, for the full sample we use the parametrization bfuiiiz) = 1-1- 0.84z, based on the 
simulations of Weinberg et al. (2004) and quoted in the LSST science book (LSST Collaboration et al. 2009). Red 
galaxies should have a larger bias, which we parametrize as 5red('2) ~ 1 -|- z. This parametrization is compatible 
with bias measurements at redshifts z < 1 (e.g. Coil et al. 2008). Since the red population dies off at z ^ 1.4, this 
extrapolation should not significantly influence our final result, especially as we ultimately marginalize over b{z). 

The redshift distributions for the two samples considered here are shown in Figure B3. For this figure, as well as in 
our forecasts, we assume a magnitude limit oii = 25.3, corresponding to r ^ 26 for typical galaxy colours, as quoted in 
LSST Collaboration et al. (2009). According to the models used here, the total number of galaxies observed by LSST 
should be ~40 per arcmin^, in qualitative agreement with previous results (Ilbert et al. 2006; LSST Collaboration 
et al. 2009). 


B.4.2. Photometric redshifts, binning and shot noise 

The quality of a given photo-z algorithm is normally quoted in terms of its rms error, = ((zp^oto ~ Arue)^)) which 
is typically parametrized as 

cTziz) =ao{l + z). (B24) 

The photometric redshift requirement for the LSST gold sample, as quoted in LSST Collaboration et al. (2009), 
is (To < 0.05, with a goal of 0.02. As described above, since the spectral properties of red galaxies make their 
photometric redshifts more accurate than those of the blue population, we have assumed the following photometric 
redshift uncertainties for the two samples: 

(T™'^ = 0.02, (t“1 = 0.05. (B25) 

We have also assumed that the data will be analysed by dividing the sample into a number of photo-z bins. Let Zq 
and Zj be the limits of the i-th bin. The window function in this bin must trace the true z distribution of galaxies within 
it, and is therefore given by the product of the overall redshift distribution and the photo-z probability distribution, 
integrated over the bin: 

W\z) oc N{z) w\z), 


(B26) 
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Fig. B3.— Angular number density of galaxies as a function of redshift for the LSST “red” (left plot) and “full” (right plot) samples. 
The colored lines show the window functions of the different redshift bins considered here. 


/*4 

w\z) = / dzpp{zp\z), 


(B27) 


where p{zp\z) is the photo- 2 : pdf. Assuming a Gaussian photo -2 distribution, we can write the window functions w’'(z) 
analytically, as 


vf'{z) 



(B28) 


The tails of w'‘{z) correlate different redshift bins to a much larger degree than the intrinsic correlations due to 
gravitational clustering do, which is an expression of the loss of information on radial scales. In order to reduce this 
correlation and avoid redundant calculations, the width of the redshift bins are usually defined to be of the order of 
tTj. In this work we have chosen to define the width of our bins to be three times the photo -2 dispersion at the bin 
centre. For the values of CTz assumed here, this results in 15 bins for the red sample and 9 bins for the full sample. 
The window functions for the bins are shown in Figure B3. 

Finally, as in the case of spectroscopic and continuum surveys, the main source of statistical noise in the measurement 
of clustering anisotropies is shot noise. The noise bias term for photometric surveys is thus given by 

iVf = 4^, (B29) 

where 

(B30) 
(B31) 

Note that this reduces to Eq. (BI8) in the case of top-hat windows (i.e. —>■ 0). 


dzN{z)w\z), 


C. LUMINOSITY FUNCTION OF RADIO SOURCES 

We will summarize here the steps that were followed to compute the luminosity functions of the different radio 
sources that would be observable with a continuum survey. We follow almost exactly what was done by Wilman et al. 
(2008) to simulate the distribution of radio galaxies, but we would like to describe the details of the calculation here 
for the benefit of the potential users of the codes used in this paper. 

We will discuss five main galaxy populations: star-forming galaxies, starbursts, radio-quiet AGNs and radio-loud 
AGNs of types ERI and ERIE We note that we use the convention for the luminosity function used in Section 2.1, 
where 71 ^( 2 , L) is the physical (not comoving) number density of objects per unit InL. 


C.l. Star-forming galaxies and starbursts 

The luminosity functions of normal star-forming galaxies and starbursts used here are based on the luminosity 
function derived from the IRAS 2 pjy sample by Yun et al. (2001) at 1.4 GHz. This is modelled as a sum of two 
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Schechter functions, which we identify with two populations: normal and starburst galaxies respectively. Thus, for 
both normal star-forming galaxies and starbursts, the z = 0, 1.4 GHz luminosity function is parametrized as 

(Cl) 


where in each case: 


n* = <1 

^3.2 X 10 "‘Mpc ^ for normal galaxies 
^8.3 X 10“® Mpc“^ for starbursts 

(C2) 

^ 

f 2.10 X 10^^ [W/Hz/sr] for normal galaxies 
[1.44 X 10^^ [W/Hz/sr] for starbursts 

(C3) 

and a = —0.633 in both cases. This parametrization is valid for luminosities above logj^p-^/[W/Hz/sr] = 19.6, and it 
is assumed to be constant for lower values of L, in agreement with Mauch & Sadler (20(17). 

In agreement with Rowan-Robinson et al. (1993), we assume a pure luminosity evolution of this luminosity function. 


ns{z, L) = (1 -t zf ns{z = 0,L f{z)), 

(C4) 


, /(I + for z < Zo 

^^^>~\{l + Zo)^-^ iOY Z>Zo ’ 

(C5) 


ns{z = 0, L) = n* 


exp 


with zo = 1.5. This parametrization was obtained assuming a cosmological model = (1,0). We adapt it 

to our fiducial cosmology by scaling the number densities and luminosities by the ratios of comoving volumes and 
luminosity distancies in both models, 


n^P{z,L\n,) = 


-^(2) 


z, L 


X(2)W' 


X(2)(2:) H(^i){z) 


(C6) 


where the indices (1) and (2) indicate quantities computed in two different cosmological models. 

The SED for both normal and starburst galaxies was assumed to be 

(fiv) oc (1 — e ^) (l + lOr'GH®^) (C7) 

with r'GHz = 1^/(1 GHz) and r = (j^*/j^ghz)^'^i with = 0.005 for normal galaxies and iz* = 1 for starbursts. This 
corresponds to a combination of thermal free-free emission and non-thermal synchrotron from supernovae. We did not 
include a dust component in this SED, as mentioned in Wilman et al. (2008), which should be irrelevant for the range 
of redshifts and frequencies studied here. 


C.2. Radio-loud AGNs 

The luminosity function for FRI and FRH radio-loud AGNs was based on model “C” of the luminosity function 
at 151 MHz derived by Willott et al. (2001). This luminosity function consists of low-luminosity and high-luminosity 
components, which we identify with FRI and FRH sources respectively. 

For FRI galaxies, the z = 0 luminosity function takes the form of a Schechter function 


ns{z = 0, L) 



f L" 

Q. 

\ 

r L 1 

In 10 


j exp 



(C8) 


with n* = 10 (Mpc) log^o T*/(l W/Hz/sr) = 26.1 and a = —0.539. The model also assumes a pure density 
evolution, 

nsiz,L) = zf f{z)nsiz = 0,L) (C9) 


- /(I + ^ < ^0 

’ \{l + zoy-^ for z>0o 


(CIO) 


where Zq = 0.706. 

For FRH galaxies, the luminosity function takes the form of a Schechter function with an inverted exponential term. 


with n* = 10-6-196 I 
assumed, with 


^-3 


n* f LY 


-InloUJ 

~~L 

) = 26.95, and a = - 

-2.27. 

O 

1 





(Gil) 


(G12) 
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0.559 for z < zo 
1.378 for z > Zo 


cr*{z) = 


(C13) 


where Zq = 1.91. As in the case of star-forming galaxies, these luminosity functions were derived for an Einstein-de 
Sitter background, so had to be adapted to our fiducial cosmology using Eq. (C 6 ). 

A power-law SED with (p{v) oc was assumed for both types of radio-loud AGN. 


C.3. Radio-quiet AGN 

Radio-quiet AGN make up the majority of the total AGN population, as is observed from the hard X-ray luminosity 
function. This can be combined with the relation between hard X-ray and radio luminosities (Brinkmann et al. 2000), 

logio(A 2 -iokev/(ergs“^)) = 1.012 logio(Ai.4GHz/(W/Hz/sr)) -b 21.3, (C14) 

to derive the 1.4 GHz luminosity function. 

Eor this we use the X-ray luminosity function of Ueda et al. (2003) in the 2 — 10 keV band, parametrized at z = 0 as 

n,(z = 0 , Lx) = ^ Px/A*)^^ + (Lx/A*)'^^]-', (C15) 


where Lx is the X-ray luminosity, A = 5.04 x 10 ®Mpc 71 = 0.86, 72 = 2.23, and log 4 Q(L*/(ergs ^)) = 43.94. 
The evolution with redshift is parametrized as a luminosity-dependent density evolution, 

ns(z, Lx) = (1 + z)^ f{z, Lx) ns{z = 0 , Lx), (C16) 


with 


f{z,Lx) 


(1 -b for z < Zo 

(1 + Zo)4-23 for z>Zo, 


where Zq is a luminosity-dependent function 


zo{Lx) 


ri.9(Lx/Ai)0-335 for Ax<Ai 

\l.9 for Ax>Ai, 


(C17) 


(C18) 


with logio(Ai/(ergs i)) = 44.6. 

A power-law SED with q}{v) oc was assumed for radio-quiet AGN. 
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